Journal of 


Experimental Psychology 


ARTHUR W. MELTON, Editor 


Arr Force Personner anp Traintnc Researcn Center 
Lacxianp Ar Force Base 
Saw Antonio, Texas 


CONSULTING EDITORS 
Judson S. Brown, State University of Iowa Lloyd G. Humphreys, AF Personnel and Train- 
Cletus J. en AP ym: 7 uf ing Research Center 
Ros - Gagné, ersonnel a Arthur L. Irion, Tulane University 
*w. R. “ nyhe Foe Hopkins Uni- = Lindsley, University of California, 
owe Neal E. Miller, Yale University 
Frank Geldard niver sity am ’ 
James me if ern ry es Kenneth W. Spence, State University of Iowa 
H. Graham, Columbia Vosleersity Benton J. Underwood, Northwestern University 
David A. Grant, University of Wisconsin Delos D. Wickens, Ohio State University 


Lorraine Bouthilet, Managing Editor 





CONTENTS 


The Timing of Signals in Skill: R. Connap 
as Bee Yee ais wo Meter Performance os Function of Motor Task 
Complexity: W. F. : 


a a Shifts: R. T. Ketzznze 379 


Some Sources of Difficulty in Solving Si Probl 
—"" my. Goopnow anp T. F. Perricrew 385 


Effects of Stimulus Similarity on Discrimination Learning: C. C. Sprxer 
Constant-Sum Judgments of Facial Expression: T. Encen anv N. Levy 


Perceptual Behavior as Related to Factors of Associative and Drive Strength: 
J. Encier anp J. T. Freeman 399 
PaaS Gemeativaten 0 6 Pension of lntetad: Response Similarity : 
. E. Nostz Ad H. P. Bameicx 405 
A Failure to Find Second-Order Semantic Generalization: I. Mattzman anv L. O. Brooxs 413 
Tom teens te Angee F Different Task Instructions and Training: 
Matrzman, E. Ersman, L. O. Brooxs, ann W. M. Smiru 418 
vee Comes oo Bver Belateemnes Velowing Beputed Se Beapenees 
E. w 421 


rien an AND F, 


Sampled-Data Tracking: Sampling of the Operator’s Output: C. A. Benwerr 





American Psychological Association 
Vol. 51 No. 6 June 1956 





JOURNAL OF EXPERIMENTAL PSYCHOLOGY 


oe, JouRNAL oF Pe Pocemats is published om ty 
poy Minecaganne F peavey | American Psychological Association, Inc., 
wt 8.00 916.00 sally Single cov wets ae oa 

.00, or $16.00 ann ‘ copies are $1.50. ions, , 
address changes, and business communications should be addressed to the 
d oe ical Association, Inc., 1333 Sixteenth St. N.W., Wash- 
ington 6, D. C. 


_ This Journa publishes i imental investigations which are 
intended to contribute toward « Me sete of as an experi- 
mental science. Studies with normal human subjects are favored over 
studies involving abnormal or animal subjects. Studies in applied experi- 
mental or engineering psychology may be if they have 
broad implications for experimental and theoretical sehen Normally, 
articles of a length exceeding 15 printed pages cannot be accepted. How- 
ever, an integrated series of studies accomplished simultaneously (e.g., 
most doctoral dissertations) must be presented in a single article, rather than 
in a series of articles. 

Articles are published in the order of their receipt, ex in rare circum- 
stances. Authors are supplied with 50 free offprints without covers. All 
of the cost of an author's alterations in galley proof is charged to the author. 
Priority in publication is given to articles whose authors assume the full 
cost of publication. In 1956 the publication cost is $20 per printed page. 
Authors of priority publications receive no free offprints. 


Address all articles submitted for regular or priority publication to the 
editor, Arthur W. Melton, Air Force Personnel and Training Research 
Center, Box 1557, Lackland Air Force Base, San Antonio, Texas. All 
manuscripts should be submitted in duplicate. Original figures are pre- 
pared for publication; duplicate figures may be photographic or pencil- 
drawn Authors are cautioned to retain a copy of the manuscript 
to guard against loss in the mail. 

To save printing costs supplementary material to articles in this Journal 
are deposited with the American Documentation Institute. Copies may be 
obtained . ordering the Document Number given in the footnote reference 
Photoduplication ‘Service, Library of Congress, Washington 25, D.C. 

uplication ice, Li ongress, ington 25, D. C. 
Orders must be accompanied by checks or money orders payable to the 
Chief, Photoduplication Service, Library of Congress. 








Journal of 


Experimental Psychology 





VoL. 51, No. 6 





JuNE, 1956 


THE TIMING OF SIGNALS IN SKILL! 


R. CONRAD 


Applied Psychology Research Unit, Cambridge, England 


Some years Bartlett (2, 3) 
called attention to the importance of 
timing in skilled behavior. The term 
was used in its ordinary everyday 
sense to refer to the temporal “layout” 
of a series of responses. 


ago 


Thus good 
timing would show in smooth per- 
formance, bad timing in irregular, 
jerky performance. Although both 
the descriptions of good and bad 
timing, and the of its 
importance were penetrating, the 
concept lacked precision as a scientific 
term. 

Keeping close to the sense in which 
Bartlett used the term, the author 
(S) showed experimentally that in a 
sensorimotor skill which permitted 
some latitude in the moment of 
initiation of a response, the moment 
chosen tended to be that which gave 
Ss more time for response than they 
would have had if their responses had 
more exactly matched the signal 
series. That is, if Signal 1 was seen 
to be followed closely by 
then Response 1 


assessment 


Signal 2, 
was advanced in 

The work reported was sponsored by the 
Medical Research Council, and Ss were made 
available by the Royal Navy. Miss Barbara A. 
Hille carried out the testing and the analysis of 
records, and the author is grateful to Dr. N. H. 
Mackworth for reading and commenting on this 
paper when in draft form. 


365 


time. But if Signal 2 was seen to be 
some time after Signal 1 then Re- 
sponse | tended to be withheld until 
after Signal 1 had occurred. Based 
on these results, timing was defined 
(7) as “creating the most favorable 
temporal conditions for response.” 
The conditions in which timing 
occurs in any skill are defined by the 
nature of the particular task. The 
degree of constraint to which the 
timing must submit will vary. In 
unpaced tasks, constraints are slight. 
The moment of initiation and the 
duration of responses are relatively 
unrestricted. But in paced tasks S 
has little no power to decide 
whether or not events requiring action 
should occur in the environment or 
display. He might have freedom to 
determine when to deal with them, 
however, and in many cases he will be 
free to decide when they should occur. 
This can happen in two different 
ways. In the first, S cannot affect 
the time of onset of a signal. The 
“timetable” laid down by con- 
ditions which are beyond his control 
Skills in which the display consists of 
a number of dial indicators are often 
of this kind. These might be the 
control of chemical processes, or the 
work of aircrew when the flight char- 


or 


18 





366 


acteristics are put in by another 
person, or those periods in many 
competitive games when the ball or 
the initiative is with the opponent. 
This case has been considered experi- 
mentally by the author (5S). 

In the second case, S can, within 
definable limits, change the time 
interval between successive signals. 
Usually the limiting factor is that he 
cannot change the relative differences 
between successive intervals, but he 
can slow down or speed up the rate 
of change of the environment or 
display as a whole. ‘Though far from 
being a complete description, this 
type of control is an important 
feature of, for instance, the skill of 
car driving, and other skills in which 
the operator can vary his own “speed” 
relative to that of the environment. 
It is obvious that changing speed in 
this way could be a highly effective 
method of timing as it has been 
defined above. 


In the present experiment, a task 
largely of this second kind was used. 
A series of signals in a complex sen- 
sorimotor task was available to S who 
had continuous control over the rate 


at which they occurred. The object 
of the study was first to observe how 
he utilized this control, and second to 
observe whether or not the quality 
of his performance changed when he 
was deprived of this control. 


Metuop 


Apparatus.—The apparatus used was the 
Multi-Dial Display described by Carpenter and 
Conrad (4) and Conrad (6). An ecight-dial 
4 X 2 display was used in this case. Each dial 
of this apparatus carries a visible revolving 
pointer, which stops at clearly indicated marks 
at the 6 o’clock and 12 o'clock positions if S 
makes no response before the pointer reaches 
either mark. When a pointer thus stops, the 
single hand on an 18-in. diameter clock also 
visible to S registers minus one. Further losses 
are registered if a pointer remains stopped, the 


R. CONRAD 


rate of loss being proportional to the speed of the 
pointer when running. Pointers can be re- 
started by pressing the corresponding spring- 
loaded switch on a control panel which S has 
before him. Restarting a pointer neither gains 
nor loses points. 

In order to gain points on the clock S must 
decide in advance which of the eight pointers 
will next reach the 6 o'clock or 12 o’clock mark, 
press the appropriate switch just before it 
reaches the mark, and hold it down while the 
pointer crosses the mark and so continues re- 
volving without stopping. Each such successful 
response gains one point. Thus a high over-all 
score can be obtained by making many correct 
decisions accompanied by correct action with 
few and brief pointer stoppages. 

The pointers are driven through a chain of 
gear wheels by a single motor. The relative 
pointer speeds are therefore fixed, but the speed 
of each is different from that of the others. ‘The 
faster the pointers revolve, the more frequently 
will decisions be required. 

A new feature now added to the Multi-Dial 
Display is that pointer speed is entirely under 
S’s control. He can vary the speed of the main 
motor by turning a potentiometer. This is 
sprung in such a way that the motor (and 
pointers) stops if he releases this control, and 
increases speed linearly as he turns the po 
tentiometer in a clockwise direction. The S 
operates this speed control with his left hand, 
and the panel of switches with his right hand. 

Because the pointers revolve at different rates, 
if a constant motor speed is maintained the 
intervals between successive signals will be 
unequal, and in an effectively nonrepeating 
sequence. Cox and Smith (8) have shown that 
the distribution of this interval approximates to 
that of the negative exponential. This is the 
basic temporal pattern of events in the display. 
But because the speed-control potentiometer is 
almost instantaneous in its effect, its use can not 
only change the value of the mean interval 
between signals, but it could be used to change 
individual intervals, affecting the entire nature 
of the distribution. 

The motor speed is recorded by summing its 
revolutions during tests on a numeral revolution 
counter, A multichannel Biro pen event marker 
records when a pointer stops, and the moment 
at which pointers cross the -irget mark when a 
successful response is made. These events are 
recorded separately for each dial on paper 
moving at 1 cm./sec. 

Procedure.—Eighteen naval ratings served as 
Ss. On each of five days they practiced for 30 
min. They began with only three dials in the 
display and worked up to reasonable proficiency 
with eight dials. The main instruction given 





TIMING OF 


was to make as high a score as possible and Ss 
The Ss quickly 
learned that driving the pointers too fast tended 
to lose them more points than they gained 
because many stoppages occurred. Driving the 
pointers too slowly, on the other hand, resulted 
in very lew stoppages, but scored points only 
slowly. The task itself is intrinsically inter- 
esting and the motivation of Ss was exceptionally 
high 


During these practice day s. the sessions were 


were told each other's score. 


or 10-min 
On Day 6 a 10-min. experimental 
At the end of it, the clock 
noted, and the 


per minute at which the motor had 


broken up into short periods of 5 
duration 
test run was made, 
score was average speed in 
revolut ons 
from the 
showing on the revolution counter. 


been driven was calculated figure 
The speed 
control potentiometer was then switched out of 
the system. The Ss were told that the motor 
would be set automatically to run at a constant 
speed, exactly the average of the previous test 
run. Under this condition they would have no 
distribution of the interval 


between signals, and in fact all their attention 


control over the 
could be given to the problem of preventing 
pointers from stopping, again using only the 
right hand. ‘The general instruction was to try 
This 


condition was not new as it had been frequently 


to beat their score of the previous test. 
introduced during the practice days. The ap- 
propriate speed was set, which was different for 
each S, who worked for another 10 minutes. 

It may appear odd always to present the two 
In fact, 
there is no other possibility in the present case. 


conditions in the same order to each S. 


The paced speed must be identical to that chosen 
by S in the self-paced condition, and must 
therefore This would 
tend to add reliability to the results since there 
would be a slight practice effect in the second 
condition 


succeed it. constraint 


RESULTS 


The effects of timing on the display. 
The definition of timing adopted im- 
plies an active relationship between 
S and display. More specifically, the 
definition implies that S modifies the 
temporal structure of the task so as 
to make it easier. It is postulated 
that ideal timing would tend to aim 
at changing the time structure of the 
signal series from whatever it in- 


herently was to a series of equally 


SIGNALS IN SKILL 


367 


spaced events; this postulation as- 
sumes that the events are equivalent. 

There are two main reasons why 
in the present case, and in most other 
cases, the ideal aim is unlikely to be 
achieved. First, the 
known variability of human perform- 
ance, and second, because with several 


because of 


concurrent signal sources comprising 
the display there will be a number of 
occasions when two or more sources 
simultane- 
that 


interval 


signals 
ously. It 
with this zero 
cannot be made into anything else. 


present exactly 
indicated 


apparatus a 


has been 


In practice this restriction will extend 
to all very short intervals even though 
they are not infinitely short, so that a 
less ideal but more likely temporal 
structure is one in which the interval 
between events is normally distrib- 
uted. Finally, as a working prin- 
ciple it is suggested that the main 
effect of timing on an equivalent- 
signal series would be to reduce the 
variability of interval size. 

The measure of variability adopted 
was a log transformation of SD, the 
data coming from the measurement of 
the interval between every signal in 
the 10-min. test for each S. For the 
self-paced condition, the mean of the 
SD’s of the 18 Ss was .362. 

The significance of this value can 
only be assessed if it is known what 
the value would have been if Ss had 
been unable to exercise control over 
the signal series. ‘This value is pro- 
vided by the second condition, since 
for each S, the mean size of interval 
was the same as for the first. For 
the second test the mean SD was .434. 
Every S had a lower SD for the self- 
paced than for the paced condition. 
A t test showed that 
was significant at 
.OO1 level. 

The histograms in Fig. 1 compare 
some typical distributions of the 


the difference 


better than the 





R. CONRAD 


CONOITION WITH COMTmm. 


Fic. 1. Examples of distributions of the 
interval between signals under the two test 
conditions. (Distributions are in steps of 1.0 
sec. Final step includes all intervals greater 
than 10.0 sec.) 


interval between signals for the two 
test conditions. For clarity these are 
shown in the measured time units 
rather than in logarithmic units. It 
is clear that the principal effect is that 
Ss try to lengthen very short intervals, 
and to a lesser extent to shorten very 
long time intervals. From the point of 
view of the aim of the skill, there are 
sound reasons for this. For instance, 
changing an interval of say, .5 sec., 
into one of 2.0 sec. means almost 
certain success with two consecutive 
responses, when the probability of 
two failures was high and of one 
failure almost certain. 

By reducing the duration of the 
long intervals that: would occur, S 
allows himself more opportunity to 
achieve a good score by “feeding” 
himself, in the finite test period, more 
signals that are likely to lead to good 
response. 

The effect of timing on the quality of 
performance.—With the equipment 
described, responses can be clearly 
classified into successful and unsuc- 
cessful, or preferably, positive and 
negative responses. Positive re- 
sponses are those which prevent a 
pointer from stopping, negative re- 
sponses are those which merely 
restart stopped pointers. The nu- 


merical difference per unit time be- 
tween positive and negative responses 
is the rate at which S advances his 
score, i.¢., a measure of success in 
dealing with the display. Perform- 
ance under the two test conditions can 
be compared by using this measure. 
For the first condition when they 
could modify the temporal structure 
of the signal series, the mean score 
for the 18 Ss was 98. For the second 
condition (i.e., the same signal speed) 
the mean score was 83. The dif- 
ference between these means is sig- 
nificant at better than the .0O1 level. 

There is therefore considerable ad- 
vantage to be gained as a result of 
timing insofar as it reduces temporal 
variability within the signal series. 
This being the case, one would expect 
that the more skillful the timing, i.e., 
the larger the effect on the structure 
of the signal series, the greater should 


50Or 


PERFORMANCE DIFFERENCE 





DISPLAY DIFFERENCE 


Fic. 2. Relationship between change in the 
temporal structure of the signal series and change 
in performance. 





TIMING OF SIGNALS IN SKILL 


be the beneficial effect on perform- 
ance, 

For each S we have a measure of 
interval variability under the two 
conditions, the difference representing 
the effect on the display of S’s control. 
We also have measures of performance 
success for the two conditions. One 
would expect a close positive associ- 
ation between differences in interval 
variability and differences in per- 
formance. The most convenient 
index of performance is that referred 
to—the numerical difference between 
positive and negative responses. The 
larger this difference is, the better, in 
terms of the aim of the task, is 
performance. 

The relationship is shown in Fig. 2. 
It is evident that Ss who make large 
differences to the display as a result 
of timing, achieve great improvement 
in their performance The 
correlation vari- 
ables is .92. 


scores. 


between these two 


Discussion 


It is clear from these results, not only 
that timing is an important component 
of skilled performance, but that it is 
important insofar as it enables Ss to 


re-allocate time within the framework 
of the task. An S whose timing is bad 
is compelled continually to make the 
best he can of conditions imposed by the 
display. But good timing forces the 
display into a temporal form which is 
convenient for S. This form has been 
shown to be in the direction of a normal 
distribution of the 
successive signals. 
Throughout this paper, the term 
timing has referred to actual modifi 
cations of the display. This has been 
an arbitrary restriction of the term, 
since it has been previously demon- 
strated (5) that timing in the sense 
defined is equally a component of skills 
in which the nominal display cannot be 
affected by S. In such cases it was 
shown that S advances or retards 


interval between 


369 


response according to his assessment of 
objective and fixed display demands. 
When, as in the present case, he can 
change the display, timing of the discrete 
key responses is not abandoned. Hur- 
ried jerky responses can still be dis- 
tinguished from the smoother apparently 
languid movements from key to key of 
the more skilled §. In the present case 
the definition and measurement of well- 
and badly-timed key responses would be 
technically difficult. The intention here 
was to examine the role of the timing of 
signals when continuous change of dis 
play and control were characteristic, and 
whether the cited definition of timing 
maintained its value. 

In discussing the mathematical nature 
of the changes effected by timing, the 
essential condition of equivalence of 
signal cannot be overstressed. One 
would not expect any equally spaced 
series of signals to be optimal if some 
of the signals were inherently more 
difficult to deal with. This is without 
doubt the reason why Knowles, Garvey, 
and Newlin (9) were able to report an 
advantage for performance in their self 
paced compared with their paced con- 
dition. In fact, when Ss could manipu- 
late the temporal structure of a series of 
signals of two levels of difficulty, one 
might expect that a bimodal distribution 
of the interval 
result. 

Finally, the experiment reported is 
relevant to the discussion of the inter 
action of two separate components of a 
motor skill. Classical studies and more 
recent ones (1) have emphasized the role 
of a second 
The present 
where the 


between them would 


component as interfering 
study 


second 


illustrates a 
component might 
interfere with the first especially during 
the early part of learning. But high 
levels of performance are only possible 
if the second component is integrated 
into the whole, when it 
positive assistance. This is merely to 
make the important distinction between 
what is truly a component of a motor 
skill and must logically contribute 
towards its aim, and purely extraneous, 
separate, distracting tasks 


case 


becomes of 


which can 





370 


play no part at all in the furtherance of 
another skill. 


SUMMARY 


In terms of a definition of “timing,” it is 
postulated that in certain skills the effect of 
timing will be to change the temporal structure 
of an approximately random series of signals to 
one in which the interval between signals is more 
normally distributed, and that the effect of this 
change on performance is advantageous. 

The experiment employed a complex sen- 
sorimotor task in which a multidial display 
presented signals for response at approximately 
random time intervals. The 18 male Ss were 
In the first they 
could operate a control by means of which the 
interval between each pair of signals could be 
changed. 


tested under two conditions. 


In the second condition they were 
It was arranged that 
the display would present signals at a rate 
exactly equal to that which each S had selected 
in the first test, but the inherent temporal 
structure of the signal series (approximately 


deprived of this control. 


random) was left undisturbed. 

The two resulting distributions of the interval 
between signals were compared. It is shown 
that the inherent distribution of the second test 
was changed in the first test in the direction of 
normality, and that the SD (log time) was 
reduced by every S. This change had the effect 
of significantly improving performance, and a 
close positive association is demonstrated be- 
tween the extent of the difference between the 
two SD’s and the extent of the improvement in 
performance score. 


R. CONRAD 


Drawing on these results, the role of timing 
in skill is discussed, a distinction being drawn 
between the timing of continuously graded and 
intermittent response. 


REFERENCES 


Baxer, K. F., Wyuie, R. C., & Gaon, R. M. 
The effects of an interfering task on the 
learning of a complex motor skill. J. exp. 
Psychol., 1951, 41, 1-9. 

. Bartiett, F. C. Fatigue following highly 
skilled work. Proc. Roy. Soc., 1943, 
B 131, 247-257. 

Bartietr, F. C. The measurement of 
human skill. Brit. med. J., 1947, 4510, 
835-838; 4511, 877-880. 

. Carpenter, A., & Conran, R. 


“ 


An apparatus 
providing signals at “random”’ intervals. 
Quart J. exp. Psychol., 1953, §, 87. 

. Conran, R. Adaptation to time in a sen- 
sorimotor skill. J. exp. Psychol., 1955, 
49, 115-121. 

. Conran, R. Some effects on performance of 
changes in perceptual load. J. exp. 
Psychol., 1955, 49, 313-322. 

Conrapn,R. Timing. Occup. Psychol., 1955, 
29, 3. 
. Cox, D. R., & Smirn, W. L. 


position of 


The super- 
strictly periodic 
Biometrika, 1953, 


several 
sequences of events. 
40, 1-11. 

Know es, W. B., Garvey, W. D., & Newt, 
E. P. The effect of speed and load on 
display-control relationships. J. exp. 
Psychol., 1953, 46, 65-75. 


(Received June 7, 1955) 





Journal of Experimental Psychology 
Vol. 51, No. 6, 1956 


TRANSFER FROM VERBAL PRETRAINING TO MOTOR 
PERFORMANCE AS A FUNCTION OF 
MOTOR TASK COMPLEXITY ! 


WILLIAM F. BATTIG 


University of Wisconsin ® 


Several recent experiments have 
been concerned with the effect of 
verbal pretraining on subsequent per- 
formance on a motor task. Since the 
results of such studies have not been 
consistent, a next logical step in 
research is to try to identify the 
variables accounting for this incon- 
sistency. 

The purpose of this experiment was 
to determine the effect of the com- 
plexity of the motor task on the nature 
and amount of transfer from verbal 
pretraining. An examination of prev- 
ious studies indicates that 
able account for 
inconsistency in 


this vari- 
much of the 
results. Perform- 
ance on simple discrimination-type 


may 


motor tasks, such as pressing a switch 
(1, 7, 9, 11, 13) or moving a lever (5, 
14, 16), is usually facilitated by verbal 
pretraining. with more 
complex tasks such as accurate lever- 
positioning (2) and tracking (4), no 
evidence for positive transfer from 
verbal pretraining has been obtained. 


However, 


The nature of the verbal pretraining 


conditions may also be an important 
variable. McAllister (14) found that 
verbal pretraining in which both the 
stimuli and responses are relevant to 
''This research was carried out while the 
author was a National Science Foundation pre 
doctoral fellow, and is based on a dissertation 
submitted in partial fulfillment of the require 
ments for the Ph.D. degree at the University of 
Wisconsin. The author wishes to express his 
appreciation to Dr. W. J 
guidance the research was 


Brogden, under whose 
done, for his many 
helpful suggestions 

* Now a National Science Foundation post- 
doctoral fellow at Stanford University. 


the motor task (relevant S-R) pro- 
duces significant positive transfer to 
motor performance, but verbal pre- 
training in which the 
relevant but the responses are ir- 
relevant (relevant S) shows no posi- 
tive transfer. The relevant 
S-R and relevant S correspond to 
“verbal pretraining’” and “stimulus 
predifferentiation” respectively, which 
have been used by other investigators 
(1, 7, 11). The present study also 
provides a check on the effect of this 
variable. 


stimuli are 


terms 


Metuop 


A pparatu f. 


used in the present experiment consisted of three 


The finger positioning apparatus 


relatively distinct units (a) response unit, 


The 


shown in 


(b) display panel, and (c) control unit 
apparatus, from S’s point of view, is 
Fig. 1. 

The response unit consisted of four pairs of 


Mallory 7223C 


lever-action switches mounted on a rectangular 


three position spring-return 


metal cabinet. The two switches of a pair were 
mounted in the vertical dimension, each pair 
being separated by 1} in., with a 3-in 
between the middle two pairs. A Lucite key 
(} * 1X 3 in.) covered by black rubber 

The 


was attached to 
were operated by placing the finger between the 


separation 


tape 


each switch switches 


two keys of a pair and moving the finger either 


up or down. The upper key of a had the 
lowest of the three positions as its normal resting 
position, and could be moved only upward, 
automatically returning to the normal position 
when 


normally 


released. Similarly, the lower key was 
in its highest position and could be 


As only 


pair could be moved ata given time, there were 


moved only downward one key of a 


five possible positions for each pair, one with 
both keys in the normal position, two of the 
upper key, and two of the lower ke 
keys of a pair were | in 


The two 


apart, allel to each 


other and to the table when normal 








Fic. 1, 


position. There was a 60° angle between suc 
cessive key positions; however, the linear extent 
of movement could vary from { in. to 1} in., 
depending on where S’s fingertip was placed on 
the key. 

The display panel consisted of four double 
columns of 6-w. lamps, five ineach column. The 
lights in a column were 14 in, apart, and the two 
columns in a double column were separated by 
1 in., with 2 in. between each double column 
The column at the left of each double column 
was colored red, that at the right was colored 
green. Fach column of green lights was con 
nected with one pair of response keys. The 
five green lights from top to bottom corresponded 
directly to the five key positions from top to 
bottom 

The control unit was operated by £ and 
consisted of switches for controlling the red 
lights on the display panel, plus the appropriate 
controls for scoring and turning on and off 
various components of the apparatus. 

The stimulus for the motor task consisted of 
the lighting of one red light from each column. 
The S's task was to light simultaneously each 
green light corresponding to the red lights which 
were lit as quickly as possible by moving the 
keys to the appropriate positions with the fingers. 
This constituted a match. Self-paced conditions 
were used so that after a match had been made 








WILLIAM F. BATTIG 


The apparatus from S’s point of view. 


and held for approximately 4 sec., a new pattern 
of red lights was presented, and S immediately 
proceeded to match the new pattern. 
formance was scored in 
matches in a I-min. trial 

The response unit and display panel were 
placed at opposite ends of a 6-ft. table. The 
control unit was on a table behind a partition to 
S’s right, at which E sat. The S sat in an 
adjustable chair, so that his elbow height was 
approximately that of the response keys. 

Subjects. —The Ss were 240 University of 
Wisconsin undergraduate psychology students, 
120 of each sex, who received class points for 
their services. They were unsystematically 
assigned to 12 groups of 20 Ss each, with an 
equal number of males and females in each 
group 


Per- 


terms of number of 


The 12 groups formed a 3X 4 
factorial design, with three types of pretraining 
conditions and four levels of motor task com 
plexity, differentiated in terms of the number of 
fingers used on the task in four steps from one to 
four. The one-finger groups used only the index 
finger of the right hand on the pair of keys and 
double column of lights at the extreme right of 
the apparatus. 


Design 


The two-finger groups used the 
index and third fingers of the right hand on the 
two key-light units at the right. The three 
finger groups used in addition the index finger 





VERBAL PRETRAINING AND MOTOR PERFORMANCE 


TABLE 1 


Letrers Usep in Retevant S Pretrainine 





Column 


N 

R 

P 

M 
H 








of the left hand, operating all key-light units 
except the one at the extreme left. The four- 
finger groups used the index and third fingers of 
both hands on all four key-light units. 

The pretraining conditions were as follows: 
(a) Control: No pretraining was given.’ (b) 
Relevant S: The green light columns were 
covered by cards containing letters of the 
alphabet, one letter corresponding to each red 
light. The S’s task was to pronounce aloud 
nonsense words formed from the letters reading 
from left to right which corresponded to the red 
lights of the stimulus patterns. The letters for 
the four-finger complexity are shown in Table 
1. For each successively lower level of com- 
plexity the column of letters at the right was 
eliminated, and the other columns moved one 
position to the right. The second letter in each 
word was always a vowel, making the words 
pronounceable yet relatively meaningless. 
Nothing was said about memorizing either the 
words or letters. (c) Relevant S-R: The green 
lights were turned off, and S’s task was to 
describe verbally the key positions which would 
match the red light patterns, using the terms 
two-up, one-up, center, one-down, or two-down. 

Procedure-—The experimental session was 
about 50 min. in duration. Upon entering the 
experimental room, all Ss were seated at the 
apparatus and given standard instructions re- 
garding the operation of the apparatus and 
procedure to be followed. The control groups 
proceeded immediately to practice on the motor 
task. The pretraining given the relevant S and 
relevant S-R groups consisted of the presentation 
of 50 red light patterns at 10-sec. intervals, to 


* The control groups were run prior to the 
verbal pretraining groups and were given 30 
trials of motor practice. Only the data from 
the first 20 trials are presented here. The 
additional data and further analyses bearing on 
complexity of the motor task, along with a 
justification for the use of the matches score, 
can be found in the dissertation on which this 
report is based (3). 


373 


which the appropriate verbal response was 
made. The relevant S groups were then asked 
to write down as many of the letters as they 
could remember (this was also done at the close 
of the experiment), in order to provide a check 
on any relationship between learning of the 
letters and motor performance. Following the 
pretraining, a summary of the initial instructions 
was given, along with the suggestion that use 
should be made of the verbal pretraining, since 
it shou'd be helpful. 

All groups performed under identical con- 
ditions on the motor task, receiving 20 1-min 
trials with a 15-sec. rest after each trial, and a 
l-min. rest after every fifth trial. After each 
trial Ss were told their scores in terms of number 
of matches, 

A single sequence of 50 stimulus patterns was 
constructed for each level of complexity and used 
repeatedly throughout motor practice. The 50 
pretraining patterns also followed this sequence. 
For each higher level of complexity this sequence 
was identical to that for the next lower level, 
except for the addition of another red light to 
each pattern. This made the sequence identical 
for all groups on the key-light unit at the right 
of the apparatus, and so on. The sequence was 
constructed on the basis of criteria designed to 
provide that each possible combination of finger 
positions appeared with approximately equal 
frequency, and to minimize sequential de 
pendencies. Fifty different patterns were used 
for the three- and four-finger complexities, but 
only five were used with one-finger, and 24 for 
two-finger complexities (the pattern with both 
lights in the center position was not used) 


ResuLts 


Since analysis of variance revealed 
no significant sex difference (F < 1), 
the two sexes were pooled in analysis 
of the results. The mean number of 
matches for each group on each trial 
is presented in Fig. 2. 

An application of Bartlett’s test 
(8) on the total match scores gave a 
x? of 39.50 (p < .OO1), causing re- 
jection of the hypothesis of homo- 
geneity of variance. The source of 
heterogeneity was such that no ra- 
tional transformation was applicable. 
To compensate for the increase in 
Type I errors which may result from 
this violation of the assumptions 
underlying the analysis of variance, 














wn 
™“s 
a 





% 
a. 
a | tt. 
as | e Sre-S go mle 
td p ° i -“o-3 es thie . 
_ ieee 
ly COMP ent PRE TRANG 
« st et.tvent 6 
3 °2 PeLevent o® 
& 9) «5 ——_e CONTROL (ONE) 
* sii 
ty 
% o--e~9- <0 
5 » ETS wees 
anne oo 
i = ee - age” ome ca 
920 en ee a 
| Pa - * ° aaa 
a4 fe = at et — 
é = - ‘ 
; Se 
10) La" r = pte re 
‘ * el 
a . 
TT TTI CT CTT OTT TTT wT ie 
he 
Fic, 2. Mean number of matches per trial 


(matches score) for each of 12 groups. Breaks 
in curves signify longer 1-min. rest interval. 


the 1% level of significance was 
adopted instead of the normal 5% 
level. 

All the data of Fig. 2 were included 
in a single analysis of variance, pre- 
sented in ‘Table 2, which showed all 
main effects and interactions to be 
significant. The term of primary 
interest is the complexity by pre- 
training (C X P) interaction, which 
indicates that the effect of the pre- 
training conditions varies with the 
complexity of the motor task. Figure 
2 shows the source of this interaction, 
verbal pretraining was facilitative for 
the one- and two-finger, but not for 
the three- and four-finger complexi- 
ties. 

Separate analyses of variance on 
the three pretraining conditions within 
each level of complexity were carried 
out to further specify the nature of 
this interaction. Pretraining was sig- 
nificant for one finger (F = 6.63) and 
two fingers (F = 7.70), but not for 
three fingers (F = 2.14) and four 
fingers (F = 0.19). Duncan range 
tests (6) showed that both verbal 


WILLIAM F. BATTIG 


pretraining conditions were superior 
to the control group for one finger, 
but there was no significant difference 
between relevant S and relevant S-R 
pretraining. For two fingers, rele- 
vant S-R pretraining was signifi- 
cantly superior to both relevant S 


and control conditions, which did 
not differ significantly from each 
other. 


As is obvious from Fig. 2, there is a 
significant difference in motor per- 
formance between levels of com- 
plexity, accounting for the significant 
complexity (C) term in Table 2. 
Duncan range tests showed that each 
level of complexity differed signifi- 
cantly from each other level. The 
significant pretraining (P) term indi- 
cates a difference in the over-all 
effects of the pretraining conditions 
in this experiment. Relevant S-R 
pretraining was significantly superior 
to both relevant S and control con- 
ditions, but relevant S_ pretraining 
did not differ from the control groups, 
according to Duncan range tests. 

The remaining terms of Table 2 are 
concerned with learning effects. The 
significant trials (T) simply 
shows that there was over-all im- 
provement in performance with prac- 
tice, while the C & T, P & T, and 
C X P X T interactions indicate that 


term 


TABLE 2 


Summary or ANAtysis or VARIANCE 
or Marcu Scores 


Source df 


Complexity (C) 
Pretraining (P) 
CxP 

Ss/Groups 228 
Trials (T) 


CMP xT 
Residual 








* Significant at .01 level. 





VERBAL PRETRAINING AND MOTOR PERFORMANCE 


the course of learning varied as a 
function of both complexity of the 
motor task and type of pretraining, 
and that the relationship between 
course of learning and pretraining 
varied with the level of complexity. 
These interactions are largely at- 
tributable to the fact that the one- 
finger task was quite simple, so that 
learning and approach to asymptotic 
performance was more rapid than for 
the other three complexities. Analy- 
sis reveajed a significant P x T 
interaction only for the one-finger 
complexity, indicating that differences 
between pretraining conditions di- 


minish with practice for this com- 
plexity, but not for the three more 
difficult levels of complexity. 


To further specify the nature of 
these variations with practice, sepa- 
rate analyses were carried out on the 
first and last trials. The C X P 
interaction was significant for Trial 1 
(F = 11.63), but not for Trial 20 
(F = 1.67), indicating that the ob- 
tained variation in the effect of verbal 
pretraining with complexity may be 
specific to the early stages of practice. 

There is some question as to the 
equality of the units of measurement 
for the four levels of complexity, since 
a single match represents a progres- 
sively greater amount of performance 
as complexity increases. To adjust 
for this difference, a percentage trans- 
fer measure was computed for each 
verbal pretraining group, using a 
formula suggested by Gagné, Foster, 
and Crowley (12). 


Per Cent Transfer = 


In this formula, 7 is the score of the 
verbal pretraining group, C, is the 
score of the control group at the 
corresponding stage of practice, and 
C, is the score of the control group 
on its last trial. These scores were 


Pel TReIN NG 


» 
° 


eft. tvant $ 


@ MEL ivant 6A 


. 


TRANSFER 
- 


WEAN PER CENT 


° 


2 
COMPLEXITY 


Fic. 3. Percentage transfer scores for each 
group given verbal pretraining. Complexity in 
terms of number of fingers used on the motor 
task is on the abscissa. 


computed for several stages of prac- 
tice, with similar results, and the 
mean of the first five trials was 
selected for presentation as the best 
estimate of initial transfer. It should 
be noted that the validity of these 
scores is limited by the absence of a 
measure of the highest possible score 
on the motor task. ‘The final control 
group scores were selected as the best 
available estimate. 

The percentage transfer scores for 
relevant S and relevant S R_ pre- 
training for each level of complexity 
are presented in Fig. 3. 
show a 
centage transfer as complexity in- 
creases, the only reversal coming 
between three- and four-finger com- 
plexities in the relevant S$ condition. 

Bartlett’s test gave a x? of 8.95 
(P > .05), so that the hypothesis of 
homogeneity of variance is tenable 
for the percentage transfer scores, so 
the normal 5% level of significance 
was used. These data were analyzed 
by means of orthogonal polynomials 
(10), summarized in Table 3. In the 
combined analysis, the average of the 
two curves of Fig. 3 is shown to have 
significant linear and quadratic com- 


Both curves 


consistent decrease in per- 





376 


TABLE 3 
Anatyses or VARIANCE witH OrTHOGONAL 


POLYNOMIALS OF PERCENTAGE 
Transrer Scones 


Source Mean 


Square 


Combined Analysis 


25788.7 
65238.1 
115753 
552.6 
4779.7 
1508.1 
61.2 
4448.9 
14.0 
495.0 


52.10°* 
131.79** 
23,.38°* 
1,12 
9.70°* 
3.05* 
0,12 
8.99** 
0.03 


Complexity (C) 3 


Pretraining (P) 

CxP 
Linear 
Quadratic 
Cubic 

Error 152 














Relevant S 





18. 19** 

36.33°* 

18.00°* 
0.23 


15345.0 
3065 1.3 
15188.3 
195.3 
843.8 


Complexity 
Linear 
Quadratic 
Cubic 

Error 





elevant S~R 
11951.8 
34648. 1 
835.9 
371.3 
857.5 


Complexity 3 
Linear 
Quadratic 

¥ Cubic 

Error 76 


13.94** 

40.41** 
0.97 
0.43 














* Significant at .05 level. 
** Significant at .01 level, 


ponents. The significance of the 
C XP interaction shows that the 
shapes of the two curves differ sig- 
nificantly. Only the quadratic com- 
ponent of this interaction is signifi- 
cant, indicating that this shape dif- 
ference lies in the quadratic com- 
ponent. Analysis of the two curves 
separately reveals that the relevant 
S-R curve contains only a significant 
linear component, but the relevant S 
curve has both linear and quadratic 
components. The reversal between 
points 3 and 4 on the relevant S 
curve is not significant by Duncan 
range test. 


WILLIAM F. BATTIG 


In the relevant S pretraining, Ss 
were asked to write down the letters 
used during pretraining on two oc- 
casions. These responses were scored 
numerically and correlated with motor 
performance, yielding correlations 
ranging from .44 to — .11 for the 
various groups. This provides little 
or no evidence for memorization of 
these letters as a factor in motor 
performance. An _ interesting  side- 
light is the fact that for three of the 
four relevant S groups, these memory 
scores were higher after motor prac- 
tice than immediately after verbal 
pretraining, although this difference 
is not significant. 


Discussion 


Analysis of the data clearly demon- 
strates that the amount of transfer from 
verbal pretraining decreases as com- 
plexity of the motor task increases, 
according to the present dimension of 
complexity. A suggested explanation is 
based on a distinction between two 
processes involved in any motor task, 
called discovery and performance by 
Melton (15). The discovery process 
involves obtaining some sort of in- 
formation, usually through perception of 
stimuli, and the selection of the motor 
responses to be made on the basis of this 
information. The performance process 
involves the actual execution of the 
motor responses. If it is assumed that 
verbal pretraining affects primarily dis- 
covery, and has little or no effect on 
performance, positive transfer from 
verbal pretraining should increase as the 
relative importance of discovery in- 
creases. Results of previous studies are 
consistent with this hypothesis, as posi- 
tive transfer has usually been shown in 
simple discrimination tasks in which 
discovery is of paramount importance 
(1, 7, 9, 13, 13, 14, 16), but not with 
lever-positioning (2) and tracking (4), 
in which performance is more important. 

In the present study, complexity of 
both discovery and performance was 





VERBAL PRETRAINING AND MOTOR PERFORMANCE 


varied, but it is probable that the 
relative importance of performance be- 
came greater with increased complexity. 
If this is so, the obtained results are in 
line with the above hypothesis. How- 
ever, it remains to be explained why no 
positive transfer was found for three- 
and four-finger complexities. In 
lute there is more discovery 
involved as complexity increases, so if 
verbal pretraining facilitates discovery, 
some positive transfer should have 
resulted for the more complex conditions, 
To completely account for the results, 
it must be hypothesized in addition that 
positive transfer from verbal pretraining 
decreases with increased discovery com- 
plexity. This can readily be tested 
experimentally by increasing the number 
of S-R pairs on the Gagné discrimination 
task (11) or the Star Discrimeter (14), 
which would increase discovery com- 
plexity while holding 
complexity constant. 


abso- 
terms, 


performance 


The results can also be explained by 
assuming an upper limit on the number 
of things that an individual can react to 
differentially at a given time, and that 
increased motor-task complexity in any 


dimension increases the number of 


stimulus and/or response elements in- 


volved in performance. Presumably the 
beneficial effect of verbal pretraining lies 
in the addition of new cues relevant to 
the motor task. In performing a simple 
task, such cues can be utilized without 
disruption of other factors. With a 
complex task, however, utilization of 
these added cues will interfere with other 
essential elements of the task, producing 
a decrement in performance. Under 
these conditions, § will perform best by 
ignoring the verbal cues, so that verbal 
pretraining would have no effect. 

The significant over-all superiority of 
relevant S-R to relevant S pretraining, 
with no difference between relevant S 
and no pretraining, supports the findings 
of McAllister (14). A significant dif- 
ference in the form of the functions 
relating percentage transfer to com- 
plexity was found between relevant S$ 
and relevant S-R pretraining in the 


377 


present study. This result was not 
anticipated, and has no apparent mean 
ing in terms of the operations involved 
in the two verbal training procedures. 
The verbal pretraining procedures of 
this experiment did require 
paired-associate learning of stimulus- 
response relationships, unlike most of 
the procedures used in previous studies. 
The relevant S groups merely read their 
verbal responses, while the relevant S-R 
groups had them established by the 
instructions, so that they responded 
without error from the outset of verbal 
practice. The low correlations between 
motor performance and the extent to 
which the relevant S groups memorized 
the letters used during pretraining sug 
gests that verbal learning was not an 
important determinant of transfer. It 
might be argued that the superiority of 
the relevant S-R groups to the relevant 
S groups is attributable to 
paired-associate learning 
former condition. 


not any 


ureater 
under the 
However, there is no 
direct evidence for this, and since paired- 
associate learning was not necessary in 
either procedure, it is unlikely that the 
obtained difference can be accounted for 
in these terms. “Learning how to 
learn” and “warm-up” have been sug- 
gested as important determinants of the 
effect of verbal training (7, 9), and are 
probably involved here. However, the 
significant superiority of relevant S-R 
to relevant S pretraining indicates that 
practice of a relevant verbal response can 
be facilitative beyond any warm-up 
effects, even when little verbal learning 
is involved. 

Whether the present results would be 
confirmed with other dimensions of com- 
plexity is a subject for future research. 
If so, the results would have application 
to various motor skill training situations 
in which “on the job training” is im 
practical and some substitute method of 
training is sought. It is suggested that 
verbal training of the type used here may 
be of considerable value when the motor 
task is relatively easy, but that the 
value of such training would diminish as 
task complexity is increased. 





378 WILLIAM F. BATTIG 


SUMMARY 


The effect of complexity of a motor task on 
the amount of transfer from verbal pretraining 
was studied. Twelve groups of 20 Ss each 
formed a 3 X 4 factorial design, with four levels 
of complexity in terms of number of fingers used 
on a finger-positioning task (one to four), and 
three pretraining conditions (relevant S, relevant 
S-R, and control), Relevant S _ pretraining 
required S to pronounce nonsense words formed 
from letters corresponding to the stimulus lights 
of the motor task. In relevant S-R pretraining, 
S described verbally the correct finger positions. 
Control groups had no pretraining. All groups 
then received 20 1-min. trials on the motor task. 

Analysis of variance showed a significant 
complexity by pretraining interaction, owing tc 
the significant superiority of both types of pre- 
training to the control group for one finger, and 
of relevant S-R pretraining for two fingers, while 
pretraining had no significant effect for three 
and four fingers. A percentage transfer measure 
showed a consistent and significant decrease with 
increased complexity for both verbal pretraining 
conditions. Relevant S-R pretraining showed a 
significant over-all superiority to both relevant 
S$ and control conditions, which did not differ 
from each other. 

It is concluded that the amount of positive 
transfer from verbal pretraining to motor per- 
formance shows a consistent decrease as motor 
task complexity increases in terms of number of 
fingers used on a finger-positioning task. Pos- 
sible explanations of the results are presented 
and discussed, 


REFERENCES 


1. Baxer, K. E., & Wyue, R. C. Transfer 
of verbal training to a motor task. /. 
exp. Prychol., 1950, 40, 632-638. 

2. Barric, W. F, The effect of kinesthetic, 
verbal, and visual cues on the acquisition 
of a lever-positioning skill. J. exp. 
Psychol., 1954, 47, 371-380. 

3. Barrio, W. F. Transfer from verbal pre- 
training to motor performance as a func- 
tion of motor task complexity. Un- 
published doctor's dissertation, Univ. of 
Wisconsin, 1955. 

4. Barric, W. F., Grece, L. W., Hornsern, 
J. P., Nacet, E. H., & Brocpen, W. J. 
The effect on sighting proficiency of 
instructed practice on stimulus and re- 


. Epwarps, A. L. 


sponse components of sighting. Un- 
published manuscript, Univ. of Wis- 
consin, 1952. 


5. Cantor, J. H. Amount of pretraining as a 


factor in stimulus predifferentiation and 
performance set. J. exp. Psychol., 1955, 
50, 180-184. 


. Duncan, R. B. A significance test for 


differences between rank treatments in an 
analysis of variance. Virginia J. Sci., 
1951, 2 (new series), 171-189. 


. Dysincer, D. W. An investigation of 


stimulus pre-differentiation in a choice 
discrimination problem. Unpublished 
doctor's dissertation, State Univ. of Iowa, 
1951. 

Experimental design in 
psychological research, New York: 
Rinehart, 1950. 


. Farser, I. E., & Murrin, F. L. Perform- 


ance set as a factor in transfer of training. 
Paper read at Midwest. Psychol. Ass., 
Chicago, April, 1951. 


. Frisner, R. A.,& Yates, F. Statistical tables 


for biological, agricultural, and medical 
research, (2nd Ed.) Edinburgh: Oliver 
& Boyd, 1943. 


. Gacnt, R. M., & Baxer, K. E. Stimulus 


pre-differentiation as a factor in transfer 
of training. J. exp. Prychol., 1950, 40, 
439-451. 


. Gacnté, R. M., Foster, H., & Crow zey, 


M. E. The measurement of transfer of 
training. Psychol. Bull., 1948, 45, 97- 
130. 


. Goss, A. E. Transfer as a function of type 


and amount of preliminary experience 
with task stimuli. J. exp. Psychol., 1953, 
46, 419-428. 


. McAruster, D. E. The effects of various 


kinds of relevant verbal pre-training on 
subsequent motor performance. J. exp. 
Psychol., 1953, 46, 329-336. 


. Metron, A. W. Learning. In W. S. 


Monroe (Ed.), Encyclopedia of educational 
research, (Rev. Ed.) New York: 
Macmillan, 1950. Pp. 668-690. 


. Rossman, I. L.,& Goss, A. E. The acquired 


distinctiveness of cues: The role of 
discriminative verbal responses in facili- 
tating the acquisition of discriminative 
motor responses. J. exp. Psychol., 1951, 
42, 173-182. 


(Received June 9, 1955) 





Journal of Experimental Psychology 
Vol. 51, No. 6, 1956 


DISCRIMINATION LEARNING 


REVERSAL AND 


NONREVERSAL 


AS A FUNCTION OF 
SHIFTS! 


ROGER T. KELLEHER 


New York University 


There are two types of theoretical 
formulation which are applicable to 
the learning of consecutive simul- 
taneous discriminations in which the 
stimuli vary along several dimensions. 
Those which assume a direct associa- 
tion of physical stimuli and overt 
responses will be referred to as single- 
unit S-R theories (e.g., Spence’s con- 
tinuity theory). On the other hand, 
formulations which assume the de- 
velopment of implicit or mediating 
responses with their accompanying 
response-produced cues will be re- 
ferred to as sequential S-R theories 
(e.g., Lawrence’s acquired distinc- 
tiveness of cues). The present study 
will assess the relative merits of the 
sequential and single-unit S-R theories 
by determining the effects of reversal 
and nonreversal shifts on discrimina- 
tion learning in the white rat. 

In the reversal shift, the reward 
values of the relevant cues during 
training (first discrimination) are 
reversed during testing (second dis- 
crimination). In the nonreversal 
shift, one stimulus dimension is rele- 
vant during training and another 
dimension is relevant during testing. 
It should be noted that the response 
of approaching the positive stimulus 
of the training discrimination receives 
adventitious partial reinforcement 


! This paper is based on a dissertation sub- 
mitted to the faculty of the Graduate School 
of New York University. The author wishes to 
express his gratitude to Professor Howard 
Kendler to whom he is indebted for advice and 
encouragement in the planning and execution of 
this investigation. 

[? Now at the Yerkes Laboratories of Primate 
Biology. 


following a nonreversal shift. The 
single-unit theory (9) is not suffi- 
ciently developed to predict which 
type of shift should be accomplished 
more readily under these conditions; 
however, it does imply that both 
should yield negative transfer. If the 
partial reinforcement were eliminated, 
the single-unit S-R theory indicates 
that the nonreversal shift would be 
more rapid, and that only the reversal 
shift would yield negative transfer. 
According to the sequential S-R 
theory, the differential reinforcement 
of the relevant cues during training 
generates “mediating responses” 
which give rise to a “‘nvodified stimulus 
pattern.’*® It is assumed that new 
instrumental behavior is more readily 
associated with this “modified stim- 
ulus pattern.” Thus the reversal 
shift, which involves the same relevant 
cues on each discrimination, should 
lead to positive transfer. 
diction clearly 
rence’s analysis of his 
(5, p. 185). Conversely, the non- 
reversal shift, which involves different 
relevant cues on each discrimination, 
should lead to negative transfer. It 
might be argued that the use of the 
same instrumental response for both 
training and testing violates one of 
Lawrence’s essential conditions; how- 
ever, the second and third stages of 
his second experiment (5) both involve 


This pre- 
from 
reversal test” 


follows Law- 


“ee 


4 These responses should not be confused with 
receptor orienting responses (10) which have 
been manipulated in other situations (2, 11), 
Lawrence (4, p. 782) conceives of these mediat- 
ing responses and stimuli as theoretical con- 
structs. 


379 





380 ROGER T, 
simultaneous discriminations. The 
only critical condition for the estab- 
lishment of mediating responses is the 
differential reinforcement of the rele- 
vant cues over an adequate training 
period. 

The first experiment of the present 
investigation will be a straightforward 
comparison of reversal and nonre- 
versal shifts. The second experiment 
will embody a procedure which elim- 
inates the partial reinforcement fol- 
lowing a nonreversal shift. Both 
experiments include control groups 
which will make it possible to deter- 
mine direction of transfer. 


ExperRiMent | 


The plan of this experiment is to 
train two experimental groups on a 
simultaneous discrimination which 
involves two stimulus dimensions. 


After each S has learned the training 
discrimination, it is 


shifted to a 
simultaneous test discrimination in- 
volving the same stimulus dimensions. 
For Group R, the relevant training 
cues are still relevant, but the re- 
warded test response is the opposite of 
the rewarded training response. For 
Group NR, the irrelevant training 
cues become the relevant test cues. 
Animals of Group C are trained on a 
different simultaneous discrimination 
involving a single stimulus dimension, 
but are tested on the same discrimina- 
tions as the experimental groups. 

The sequential S-R theory predicts 
that Group R will show positive 
transfer and Group NR will show 
negative transfer. The single-unit 
theory is not sufficiently developed to 
predict which type of shift should be 
accomplished more readily. This 
theory does predict that negative 
transfer will result from both types 
of shift. 


KELLEHER 


Method 


Subjects:—The Ss were 80 experimentally 
naive, male albino rats of the Wistar strain. At 
the beginning of the experiment they were 80- 
120 days of age. Eight Ss were eliminated. 
Three could not learn the first discrimination, 
three would not perform, and two were lost 
through £’s error. 

Apparatus.—The discrimination apparatus 
was constructed of 4-in. plywood. The stem 
was 8 in. long, 4 in. wide, and 5 in. high. Its 
floor was 1 in. higher than that of the stimulus 
alleys. The entire stem was painted a flat gray. 
The S had to step over a 44-in. air gap, which 
could be narrowed for adaptation training, to a 
1 X 2-in. gray strip centered in front of two 
adjacent stimulus alleys. These alleys were 
each 15 in. long, 34 in. wide, and 6 in. high. 
Two stimulus boxes were constructed in such a 
manner that black and white alleys could be 
presented in a right-left or left-right order. The 
presence or absence of chain curtains in either 
alley served as a second stimulus dimension 
(cf. 5). Two other stimulus boxes contained 
gray alleys and unpainted alleys, respectively. 
A l-in. hurdle constructed of }-in. plywood 
could be placed 2 in. from the entrance of either 
of the unpainted alleys. Behind a wooden 
baffle at the end of each stimulus alley was a 
small block of wood with a depression in the top 
surface for food pellets. ‘The entire apparatus 
was covered by clear glass and indirectly 
illuminated by a 75-w. bulb located 36 in. above 
the air gap. 

Procedure.—A feeding schedule was in- 
stituted | wk. before the start of the experiment 
and was maintained throughout. All Ss had 
access to a wet mash for | hr. per day, and this 
was presented 15-20 min. after the completion 
of the daily trials. On Day 1 of the experiment, 
all Ss were allowed 7-10 min. to explore the 
stem and gray alleys with the air gap closed. 
Five .065-gm. pellets were present in each food 
block during exploration. On Days 2 and 3, 
Ss received eight forced trials to each of the 
possible combinations of black-white and chains- 
no chains in each spatial position. Each trial 
was rewarded by a single food pellet. The air 
gap was gradually increased to its full width 
during this pretraining. If S had not left the 
stem by the end of a 2-min. period, it was forced 
by moving the rear wall of the stem forward. 

For training, Ss were randomly assigned to 
one of three groups. The 24 Ss of the control 
group learned the hurdle discrimination in the 
unpainted alleys. Of the experimental Ss, 24 
learned the black-white discrimination (with 
chains-no chains irrelevant), and 24 learned 
the chains-no chains discrimination (with 





DISCRIMINATION LEARNING 


black-white irrelevant). Each cue appeared an 
equal number of times on the right and on the 
left, and no combination of two cues, such as 
chains and black, appeared together on more 
than two consecutive trials. 
training and testing, each S received 10 non- 
correction trials with at least 5 min. between 
trials. ‘The criterion of learning was 18 correct 
out of 20 consecutive trials. Animals that did 
not learn the first discrimination in 30 days 
(300 trials) were eliminated. 


On every day of 


For the test trials, each of the experimental 
training groups was divided into Groups R and 
NR. 


learning on the training discrimination. 


These groups were equated for speed of 
Equal 
numbers of Ss from Group C were assigned to 


each of the possible positive test stimuli 
Animals that did not learn the 


crimination in 30 days were arbitrarily assigned 


second dis 


a criterion score of 30. 


Results 
The 


number of days, excluding the 
terion day, required to attain the 
experimental criterion. The mean 
training criterion scores for the 72 Ss 
divided equally among Groups C, R, 
and NR were 5.96, 6.21, and 6.88, 
respectively. The training data were 
subjected to an analysis of variance 
which that the 
variances statistically signifi- 
cant. Thus, the assumption that the 
test groups did not differ prior to the 
test discrimination is justified. 

The were 
skewed, and the subgroup variances 


the 
cri- 


response measure was 


indicated none of 


were 


test criterion scores 
were extremely heterogeneous; there- 
fore, the nonparametric U test (7) 
was used to assess the differences.® 
The results indicate that Group 
NR shifted faster than Group R 
(P < .05), and that both of these 
groups were significantly more re- 
tarded than Group C (P < Ol). 


‘Detailed descriptions of statistical treat 
ments are presented elsewhere (3) and are 
available from the author. 

* All the probabilities were doubled so that 
the results are with a two-tailed 
hypothesis. 


consistent 


TABLE 1 


Ana.ysis or VARIANCE or Persistence Scores 


Mean 


Source 
Square 


. Treatments (R-NR) 
. Conditions (BW-CN) 
. Interaction 

. Within groups 


70.08 | 10.90* 
10.08 | 1.57 
0.08 | <1.00 
643 


*P<O 


was learned 
black-white was 
relevant than when chains-no chains 
was relevant (P < .05); however, the 
differences between Groups C, R, and 
NR were in the same direction under 
either stimulus condition. 


The test discrimination 


more readily when 


A persistence score was computed 
for each S of Groups R and NR, 
This the 
number of test days before the fre- 
quency of the correct {raining response 
fell below 80% on any one day.* 

The analysis of variance presented 
in Table 1 indicates that the mean 
score of 4.58 for Group NR is reliably 
higher than the mean score of 2.16 
for Group R. Although the Ss of 
Group R abandoned the appropriate 
training responses more readily than 
NR (P < Ol), were 


slower to learn the test discrimination. 


score was determined by 


Group they 


Experiment II 


The plan of this experiment is quite 
similar to the previous one 


Animals 
of the two experimental groups and 
the control group undergo the same 
training and testing procedures as 
their counterparts in the first ex- 
periment. however, an 
important difference in the stimulus 
situation which prevails throughout 
the test trials. For all groups, the 


There is, 


* The same analysis was carried out on the 
number of days before the correct training 
response fell to 50%. ‘The results were entirely 
consistent with those presented here 





382 


irrelevant test cues are presented in 
successive configurations, i.e., one of 
the irrelevant cues appears on both 
sides on each test trial. This mini- 
mizes the possibility of differential 
responses on the basis of these cues; 
therefore, there should be no ad- 
ventitious reinforcement for either 
experimental group. 

The elimination of the partial 
reinforcement leads to a more critical 
test of the two theoretical formula- 
tions. The sequential S-R_ theory 
predicts that Group R should shift 
more rapidly than Group C which, in 
turn, should shift more rapidly than 
Group NR. The single-unit S-R 
theory can now be invoked without 
the necessity of making any assump- 
tions about the effects of partial 
reinforcement. This theory predicts 
that Group NR and Group C should 
not differ significantly, but both 
should shift faster than Group R. 


Method 


Subiects.~The Ss were 79 experimentally 
naive, male, albino rats of the Wistar strain. 
Seven Ss were eliminated. Three could not 
learn the first discrimination, three would not 
perform, and one was lost through £’s error. 
At the beginning of the experiment Ss were 
80-120 days of age. 

Apparatus.—The discrimination apparatus 
has been described in Exp. 1; however, there 
were two additional stimulus boxes. While 
metrically identical with the others, one had 
both black alleys and the other had both white 
alleys. 

Procedure.—Exploration, pretraining, 
training were identical with 
procedures in Exp. I. 

The test situation depended upon the cue 
and the group to which the S was assigned in 
the first discrimination. For Group NR, the 
irrelevant training cues became relevant test 
cues; however, either the formerly positive or 
the formerly negative training cue was present 
in both stimulus alleys on any test trial. For 
example, if S had been shifted from chains- 
positive to black-positive, the stimulus alleys 
beth contained chains or both contained no 
chains. For Group R, one of the two irrelevant 
training cues, which were also irrelevant test 


and 


the analogous 


ROGER T. KELLEHER 


cues, was present in both stimulus alleys on any 
test trial. More specifically, if S had been 
shifted from chains-positive to no chains- 
positive, the stimulus alleys were both white or 
both black. ‘The number of presentations under 
each condition was equalized. It is apparent 
that, during the test trials, the nonreversal Ss 
could no longer maintain their training response 
on the basis of the relevant training cues. 


Results 


The mean training criterion scores 
of the 72 Ss divided equally among 
Groups C, R, and NR were 6.46, 4.62, 
and 4.62, respectively. ‘These train- 
ing scores were subsequently trans- 
formed to meet the requisite assump- 
tions for the analysis of variance. 
The analysis indicated that there are 
no significant differences among the 
test groups on the first discrimination. 

The mean test criterion scores for 
Groups C, R, and NR were 4.58, 9.16, 
and 4.16, respectively. The test cri- 
terion scores were transformed to 
correct for heterogeneity of variance 
and correlation between means and 
variances. The analysis of the trans- 
formed scores is presented in Table 2. 
The results show that test learning is 
facilitated (P < .05) when the black- 
white cues are relevant. The treat- 
ments effect is significant at the 01 
level, and ¢ tests indicate that Group 
R is significantly retarded (P < .01) 
when compared with Groups NR or 
C. The difference between the latter 
groups is not significant. 


TABLE 2 


Anatysis or Variance or Test Criterion 
Scores Transrormep To Loc (X + 1) 





j | 
Source df | ee. F 


1. Treatments (C-R-NR) 
2. Conditions (BW-CN) 
3. Interaction 

4. Within groups 


2 | 0.881 
1 | 0.618 


R.ROT* 
6.24 
2 | 0.049 | <1.00 
66 | 0.099 





DISCRIMINATION LEARNING 


Discussion 


According to the sequential S-R for- 
mulation (4, 5) there are two stages of 
discrimination learning. In the first 
stage, mediating responses and cues are 
developed which make the relevant 
stimuli more discriminable. The second 
stage involves the association of instru- 
mental behavior with these mediating 
processes. A reversal shift should be 
facilitated because the same cues are 
relevant in both discriminations; how- 
ever, the nonreversal shift should be 
retarded because different cues are 
relevant in each discrimination. The 
results of both experiments indicate that 
the reversal shift leads to negative, 
rather than positive, transfer. Even 
more damaging to the sequential S-R 
formulation is the finding that the non- 
reversal group shifts more rapidly than 
the reversal group. 

The application of the sequential S-R 
theory to discrimination learning in rats 
finds its main empirical support in the 
work of Lawrence (4, 5). In his first 
experiment (4), the results on the 
successive discrimination (first test) sup- 
ported the sequential theory, but the 
results of the simultaneous discrimina- 
tion (second test) showed a significant 
interaction which prohibited any general 
conclusions. Lawrence’s second experi- 
ment (5) involved three stages. In the 
first learned a_ successive 
discrimination involving two cue dimen- 
sions (black-white and chains-no chains). 
One dimension was relevant and the 
other irrelevant. In the second stage, 
all Ss learned a simultaneous discrimina- 
tion which could be mastered on the 
basis of either one or both sets of cues, 
i.c., both dimensions were relevant. In 
the third stage, the sequential hypothesis 
was tested in three ways: (a) the two 
cues which had been positive in the 
second stage were put in opposite alleys 
(opposition test); (6) half of the Ss in 
each group relearned the simultaneous 
discrimination with only one set of cues 
present (relearning test); and (c) the 
remaining Ss also had only one set of 
cues present but the cue values were 


stage, Ss 


383 


reversed (reversal Lawrence in- 
terpreted his results as indicating that 
the cues which were relevant in the first 
stage were the most important deter- 
miners of the test responses. Even the 
reversal test was learned more rapidly 
when these cues were present, and this 
finding has been cited as being partic 
ularly damaging to a single-unit theory 
(5; 8, p. 453). 

Inspection of Lawrence's data suggests 
an inconsistency in his interpretation. 
The Ss that were expected to approach 
chains or no chains on the opposition 
test responded at a chance level, and this 
showed up as a significant interaction 
chi square. The subgroup means for 
the relearning and reversal tests reveal 
that the results were opposed to the 
sequential theory when the test cues 
were chains-no chains.’ 
that the results show no more than a 
preference for responding on the basis of 
black-white cues. Only the 
the the 
dimension are consistent with the se- 
quential theory. Lawrence used the 
“correction-punishment” technique, and 
it is conceivable that a high response 
tendency to the incorrect test stimulus 
could facilitate reversal due to the rapid 
accumulation of inhibitory potential (cf. 
1). A more adequate assessment of this 
experiment could be made if a control 
group had been included. This group 
could have learned a neutral discrimina 
tion in the first stage and then proceeded 
to the same training and testing as the 
experimental groups. 


test). 


It is possible 


results of 


reversal black-white 


test on 


The results of both experiments in the 
present study are consistent with the 
single-unit S-R_ theory. 


Acc ording to 


7 This would suggest the possibility of a 
significant 
treatments 


interaction between 
and 


relearning and 


expe rimental 
stimulus conditions on the 
tests; however, the 
analysis of variance was carried out on trans 
formed scores and showed no _ interaction 
Lindquist (6, p. 216) notes, “Unless the main 
effect of one of the treatments is nil, the hy 
pothesis of no interaction cannot be true for 
both the original and transformed measures.’ 
In this case there may be an interaction on 
Lawrence's original scale. 


reversal 





384 ROGER T. 
this formulation, the response is directly 
determined by the excitatory potentials 
of the various stimulus components. 
This theory accurately predicted the 
negative transfer for both types of shift 
in Exp. I, and the retardation of the 
reversal shift in Exp. II. For the ex- 
planation of discrimination learning in 


inarticulate organisms, it seems pre- 


mature to abandon the single-unit S-R 
theory in favor the sequential S-R theory. 


SUMMARY 


This study included two experiments which 
were designed to determine the effects of reversal 
and nonreversal shifts upon discrimination 
learning in white rats. In each experiment, two 
experimental groups (reversal and nonreversal) 
of 24 Ss were trained on a simultaneous dis- 
crimination which involved two sets of cues. 
A control group of 24 Ss was trained on a differ- 
ent discrimination involving one set of cues. 

In Exp. I, the test discrimination involved 
the same sets of cues as the training discrimina- 
tions. For Group R, the relevant training cues 
were still relevant, but the cue values were 
reversed, For Group NR, the irrelevant train- 
ing cues became the relevant test cues. If 
Group NR persisted in their training responses, 
they received fortuitous partial reinforcement. 
The control Ss were tested on the same dis- 
criminatiors as the experimental Ss. In Exp. 
II, the partial reinforcement was eliminated by 
having one of the irrelevant test stimuli present 
in both alleys on any test trial. Except for this 
change in the test situations the two experiments 
were identical. 

The results of Exp. I indicated that both 
types of shift result in negative transfer; how- 
ever, the nonreversal shift was accomplished 
more readily than the reversal shift. In Exp. 
II, there was no difference between the test 
performances of the control and nonreversal 
groups, but the reversal group was significantly 
retarded. ‘The results were interpreted as sup- 


KELLEHER 


porting the single-unit S-R theory of Spence, 
while being inconsistent with the sequential S-R 
theory of Lawrence. 


REFERENCES 


Bium, R. A., & Brum, J. S. 
in the “continuity” controversy. 
chol,. Rev., 1949, $6, 33-50. 

Enrenrreunn, D. An experimental test of 
the continuity theory of discrimination 
learning with pattern vision. /. comp. 
physiol. Psychol., 1948, 41, 408-422. 

Ketcener, R. T. 
as a function of reversal and nonreversal 
shifts. Unpublished doctor's 
tion, New York Univ., 1955. 

Lawrence, D.H. 
of cues: I. Transfer between discrimina- 
tions on the basis of familiarity with the 
stimulus. J. exp. Psychol., 1949, 39, 
770-784. 

. Lawrence, D. H. Acquired distinctive- 
ness of cues: II. Selective association in a 
constant stimulus situation. /. exp. 
Psychol., 1950, 40, 175-188. 

. Linpquist, E. F. Design and analysis of 
experiments in psychology and education. 
Boston: Houghton Mifflin, 1953. 

. Mann, H. B., & Warrwey, D. R. On a test 
of whether one of two random variables 
is stochastically larger than the other. 
Ann. math. Statist., 1947, 18, 50-60. 

. Oscoon, C. E. Method and theory in ex- 
perimental psychology. New York: Ox- 
ford Univ. Press, 1953. 

. Spence, K. W. The nature of discrimina 
tion learning in animals. Psychol. Rev., 
1936, 43, 427-449. 

. Spence, K. W. Continuous non- 
continuous interpretations of discrimina- 
tion learning. Psychol. Rev., 1940, 47, 
271-288. 

. Wycxorr, L. B., Jn. The role of observing 
responses in discrimination behavior. 


Psychol. Reo., 1952, $9, 437-442. 


Factual issues 
Psy- 


Discrimination learning 
disserta- 


Acquired distinctiveness 


versus 


(Received May 17, 1955) 





Journal of Experimental Psychology 
Vol. 51, No. 6, 1956 


SOME SOURCES OF DIFFICULTY IN SOLVING 
SIMPLE PROBLEMS! 


JACQUELINE J. GOODNOW? AND THOMAS F. PETTIGREW 


Harvard University 


One of the curious features of 
human discrimination learning and 
problem solving is the amazing length 
of time many Ss take to notice that a 
simple and regular pattern of events 
is occurring. Prentice’s experiment 
(16) is a good example. His Ss chose 
between two designs on each trial. 
One design was always a square and 
the other a circle; the size of either 
could be large or small, the position 
on the left or the right, and the color 
of the figure or the background could 
be either black or white. The event 
pattern to be learned was that circle 
was the correct choice when the 
design on the right had a black back- 
ground, and square the correct choice 
when the background was _ white. 
The Ss were allowed to take their 
time in making a choice, but still for 
20 Ss the mean trials for learning was 
41 (SD = 18). And 13 additional Ss 
had not learned at all by the end of 
100 trials. A similar wide range of 
scores, with some extremely high 
scores, has been reported by Gel- 
lerman (3) with a double alternation 
temporal maze, and by Marquart (13) 
with a discrimination pattern 
parable to Prentice’s. 

What lies behind this slowness in 
learning a _ relatively 
pattern? In all the 


com- 


simple event 
experimental 


1QOne of a series of studies comprising the 
Cognition Project, of Social Re- 
The authors wish 
to express their appreciation of advice given by 
Dr. Jerome S. Bruner and Dr. Ray Hyman, and 
assistance in running the experimental groups 
given by Dr. L. Minturn. 

*Now at Walter Reed Army Institute of 
Research. 


Laboratory 
lations, Harvard University. 


situations cited, S is required to make 
a predictive response to each pres- 
entation of the stimulus. Our 
servations in a previous experiment 
(5) have led us to believe that the core 
difficulty for pattern learning is the 
presence of these responses. ‘The S’s 
attention is directed more toward his 
own responses and hypotheses, and 
their successfulness, than toward the 
events (i.e., toward the alternative 
which turns out to be correct). This 
response-hypothesis orientation has a 
variety of effects. 
choices as 


ob- 


The Ss use their 
tests of specific 
hypotheses rather than as tools for 
data gathering with hypothesis testing 
held in abeyance. As a result, the 
information gathered is related to a 
specific hypothesis and, if the latter 
should prove to be wrong, it is only 
with difficulty that the information 
can be transformed and made relevant 
to another hypothesis. As a rule, 
Ss do not transform information but 
start from scratch 
next hypothesis. 

frequently preoccupied 
with their level of success and failure 
that they can pay attention to little 
else. Finally, and most important, 


direct 


again with their 
Furthermore, Ss 
become 80 


in the course of directly testing one 
hypothesis after another, the vari- 
ability of their own responses obscures 


any 
tried to 


event pattern. It is as if one 
make observations in a 
scientific inquiry without a standard 
set of operations. 

If this response-hypothesis orien- 
tation is a major difficulty in the 
learning of event patterns, then 
learning should be facilitated by any 





386 


experience which leads S to pay 
attention to the events rather than 
to his own responses and especially 
by any experience which leads him 
to keep his own responses so consistent 
that change or regularity in the event 
pattern will stand out clearly. It is 
with this hypothesis that the present 
experiment is concerned. 

This hypothesis does not rest en- 
tirely upon our own observations. 
Thorndike (17) and Adams (1) long 
ago noted the advantages of “at- 
tention, often correlated with a lack 
of vigor . . . [with] the absence of a 
fury of activity”’ for successful per- 
formance by cats in puzzle boxes (17, 
p. 46). Prentice comments on the 
“random and repetitive nature of 
solutions suggested by those who 
never succeeded in solving the prob- 
lem” (16, p. 189). Heidbreder (6) 


has been most explicit about the pre- 
dominance of participant behavior or 
direct hypothesis testing in problem 


solving, and together with Gellerman 
(3) and Brown and Buel (2) has noted 
the effectiveness but rare occurrence 
of Ss using a series of identical re- 
sponses as a deliberate tool in event 
observation. 


Metuop 


A pparatus.—The apparatus and general pro- 
cedure have been described in detail elsewhere 
(4) and are the same as described in the previous 
experiment (5). Briefly, S plays a kind of two- 
armed bandit, with a choice on each trial between 
pressing a left key and a right key. The S 
activates the machine by inserting a chip, and 
then presses the key which he thinks will pay 
off. If S selects the correct key, the bandit 
drops a chip into a payoff slot. This is counted 
asawin. If nochip comes down, this is counted 
as a loss. The S is staked by £ to 250 chips. 
At the end of the experiment, S is paid the dif- 
ference between the number of times he has won 
and the number of times he has lost (one cent 
for each unit). The bandit is operated by £ 
from an adjoining room. All Ss are led to 
believe, however, that the machine is automatic. 
They are given only one piece of information 


JACQUELINE J. GOODNOW AND THOMAS F. PETTIGREW 


about the way the bandit pays off. This is that 
one key or the other, never neither nor both, will 
pay off on each bet. 

Design.—For all four groups of Ss, the final 
pattern to be learned is LR (single alternation). 
For three groups, the order of experience is (a) 
learning an initial pattern to criterion (either 
LR, LL, or LLR); (b) 70 trials of “random” 
50:50 payoff, and (c) learning the final pattern 
LR to criterion. For the fourth group (the 
control group), Step (a) is omitted. 

The initial pattern is presented to S until he 
meets the criterion of 15 consecutive errorless 
trials, unless he fails to begin meeting this 
criterion within the limit of 100 trials. All Ss 
move from one phase to the other without any 
time break. All Ss are told that the experiment 
is a study of gambling and no information is 
given about possible patterning. 

These three initial patterns (LR, LL, and 
LLR) were chosen as a check on the following 
possibilities: (a) Any prior experience of 
regularity will lead to an expectation of order or 
regularity and to closer attention to the en- 
vironmental events. This possibility is an 
extension of Hilgard’s arguments (7). If it 
holds, all groups experiencing initial regularity, 
whatever its form, should learn final LR faster 
than the control group. (b) Prior experience 
with one pattern leads to S’s being alerted for 
recurrence of the same pattern. This possi- 
bility would follow from Lawrence’s study of 
acquired distinctiveness (10) and from our own 
previous experiment (5). In such a case, the 
group starting with LR should show faster final 
learning of LR. (c) Prior experience with a 
pattern leads to faster final learning when the 
prior experience establishes a consistent mode of 
response against which the final LR pattern 
stands out in relief. If this is so, the LL group 
should show faster final learning of LR than does 
the control. 

The argument behind this third possibility 
rests on the special nature of the arrangement of 
trials in the 50:50 period. The event proba- 
bilities add up to 50:50 over the series of 70 
trials but depart from it within each block of 10 
trials. In the first block of 10 trials, the events 
(the alternatives turning out to be correct) are 
in the form 8L:2R, in the second block 4L:6R, 
and in subsequent blocks 6L:4R, 3L:7R, 7L:3R, 
5L:5R, and 2L:8R. We used this arrangement 
because it gives longer runs of events than the 
usual random arrangement within restricted 
blocks, and, because of these longer runs, it is 
especially conducive to maintaining or disrupting 
consistent modes of response established by the 
initial patterning experience. 

The group for which a long-run 50:50 ar- 
rangement maintains a consistent mode of 





SOME SOURCES OF DIFFICULTY IN SOLVING SIMPLE PROBLEMS 


response already established is the LL group. 
We know from the previous experiment (5) that 
experience with LL followed by long-run 50:50 
gives rise to a strategy of “win-stay, lose-shift” 
throughout the 50:50 phase. The Ss stay with 
a winning ke~ for as long as it pays off and shift 
from it only after it has ceased to pay off for one 
or two trials. It is this consistent mode of 
response which, we expect, will facilitate learning 
final LR by giving rise to such a consistent form 
of error that the event pattern will be easily 
grasped. The way in which this can occur is 
illustrated below. 


Key Paying 
OF Response* 
L 
L 
R 


Respon ad 


L 
R 
L, 


* With strategy of win-stay; 


shift after losing once 
** With strategy of win-stay 


; shift after losing twice 


When S shifts after losing once, and the event 
pattern is LR, he is always one step behind the 
machine. When he shifts only after losing twice, 
he is led into a series of identical responses and 
wins on every second trial. Either way, the 
event alternation should be highly discriminable. 

The LL group is the only group where we 
would expect the long-run 50:50 arrangement to 
sustain nicely the strategy established by the 
Whatever the form of re 
sponse to which experience with LR or LLR 
predisposes S, 


initial experience 


it is unlikely to be sustained by 
an arrangement where there are long runs of 
payoff on each of the two keys 

Subject The Ss 


graduates, almost exclusively 


under 
Fach 


They received a base 


Harvard 
freshmen. 


were 


group contained 20 Ss 
pay of 75 cents plus any winnings from the 
bandit. 


387 


REesuLtTs 


Speed of initial learning.—Vable 1 
shows the distribution of trials re- 
quired to begin meeting the criterion 
of 15 
The interesting feature is 
creasing difficulty of the pattern 
learning, from LL to LR to LLR. 
One S (out of 20) fails to learn initial 
LR. Five Ss (out of 20) fail to learn 
LLR. A wide spread of scores with 


some failures is in 


consecutive errorless trials. 


the in- 


line with results 
reported by Es who used more com- 
plex event patterns (13, 16), but we 
had not expected the failure rate in 
learning LLR to besohigh. The dif- 
ficulty itself is intriguing, because Ss 
that they 
looking for a system based on such 


consistently report are 
sequences as “two lefts, three rights,” 
etc. Their hypotheses are as a rule 
far more complicated than the pat- 
terns to be learned; LLR is the closest 
in complexity to what they are re- 
portedly searching for and is yet the 
slowest pattern to be learned; both 
LL and LR, especially LL, are greeted 
with a certain dubiety because they 
seem “so simple,”’ and yet the least 
expected, LL, is the fastest learned. 
The source of difficulty not 
appear to lie in the pattern meeting 


does 


Ss’ expectations but rather in the ease 
with which an event pattern can be 
masked by response variability. 

One point should be noted about 


the Ss who are especially slow in 


TABLE 1 


Distrisution or Triacs to Criterion in Iniriat Patrern Learnine 


Triale to Criterion and % of Se in Each Group 


(,roup 


20 


100% 
50% 
20% 


100 4 
(Failure) 








JACQUELINE 


J. GOODNOW AND THOMAS F. 


PETTIGREW 


TABLE 2 


Mepian Triats to Carrerion or 15 Consecutive Exroriess Trias 


in LEARNING 


Order of Events 
Group | 


| 


Initial Interpolated 


LR | 50:5 
LL | 50:5 
LLR 50:5 

50:5 


learning. One might expect that 
these Ss would be the ones who had 
given up and had ceased looking for 
an event pattern. Such Ss are rare 
in our sample, although they do occur. 
The more general rule is that, like 
Adams’ cats (1), it is the fastest 
learners who are “uncommonly slug- 
gish.”’ The slow learners are the Ss 
who are the most active and par- 
ticipant. They form hypotheses 
rapidly, usually prematurely, and 
they test one hypothesis after another. 
They learn little from the implications 
of past experience and see only the 
relevance of a result to the hypothesis 
being tested at that time. Learning 
with these Ss occurs only when they 
“run out of ideas.” As one S put it: 
“T couldn’t think of any more systems 
there could be, so I just fed some chips 
in without anything special in mind, 
just watching, and there it was.” 
This is Heidbreder’s (6) primary form 
of spectator behavior in its most 
beneficial guise. 


IniviaAt AND Finat LR 


Median Trials to Criterion 


Initial 
Pattern 


Final N 
Pattern 


Final 
Pattern 
21.0 | 19 
20 
15 
20 


The rate of failure in learning 
initial LLR calls for caution in in- 
terpreting the scores for final learning 
by members of that group. The 
slow learners may well have been 
screened out by the initial learning 
requirement. 

Speed of final learning.—Median 
trials for final learning of LR are 
shown in Table 2 and the distributions 
of scores in Table 3. The result of 
most importance is the advantage for 
final LR learning given by initial 
experience with either LR or LL, and 
the lack of advantage given by initial 
experience with LLR. 

The LL and LR groups enjoy an 
obvious superiority over the control 
group (no prior experience of pat- 
terning). Median trials to criterion 
are 8.0, 9.5, and 17.0, respectively. 
The Mosteller-Bush variation (14) of 
the Mann-Whitney test yields z 
scores of 3.53 in a comparison of the 
LR group with the control, and 2.07 
in a comparison of the LL group with 


TABLE 3 
DistrisuTion or Scores ror Learninc LR 


Condition 


11-20 


EI 
. ‘ : nee die 
Final L “Gp. 
Final L (Gp. 264 
Final L (Gp. 
Final LR (Gp. me 


Initial LR (Gp. i 





Trials to Criterion and % of Ss in Each 
© 


sroup 


41-50 51-90 | 91-100 


| ae 
a8 anne 
| 


100 + 


| 
r= | 
ia 





ail 
| 
} 
| 








SOME SOURCES OF DIFFICULTY IN SOLVING SIMPLE 


A « of 1.96 is regarded 
for rejecting the null 


the control. 
as grounds 
hypothesis. 
The medians for the LLR group 
and the control (12.5 and 17.0, re- 
spectively) suggest that some ad- 
vantage is given by initial experience 
of LLR, but this is not supported by 
inspection of the distributions in 
Table 3 or by the z score of 1.25. It 
is possible that a much larger sample 
might show a significant advantage 
accruing from initial experience with 
LLR. It is far more likely that the 
suggestion of advantage after LLR is 
the result of the slower learners in the 
group having been screened out by 
failure to learn the initial pattern. 
These results enable us to reject 
the first of the three hypotheses dis- 
cussed in connection with the design. 
This is that experience with any form 
of prior patterning will facilitate later 
pattern learning because it sets up a 
general expectation of order or regu- 
larity. This hypothesis does not fit 
the lack of facilitation by initial LLR. 
The advantage given by initial LR 
experience fits with the hypothesis 
that experience confers dis- 
tinctiveness, in Lawrence’s sense (10), 
upon a particular pattern, or that Ss 
expect a pattern previously experi- 


prior 


PROBLEMS 389 
enced to recur in later experience with 
the same stimulus material. Dis- 
tinctiveness or expectation of recur- 
rence alone, however, will not cover 
the advantage given for final LR 
learning by initial LL experience. 
For this effect of prior experience, we 
want to demonstrate that the ad- 
vantage arises from the consistent 
mode of response established by the 
initial LL pattern and sustained by 
long-run 50: 50. 

Sequential forms of response during 
the 50:50 phase.-As we expected, the 
only group showing a highly con- 
sistent mode of response during the 
50:50 phase is the LL group, and its 
mode of response is such that final 
LR stands out clearly against it. 


The method for determining response struc 
turing is as follows. We distinguish among four 
kinds of preceding trial in terms of the choice 
made (left or right key) and its outcome of win 
We then determine for each group the 
number of times that each kind of trial is fol 
lowed by a choice of left and the number of times 
it is followed by a choice of right. The ratio of 
left-to-right choices after each kind of trial is 
compared with the ratio of total left to total 


Any 


this standard is taken as evidence of a 


or loss. 


right choices significant deviation from 
“strat- 
egy,” ie, a way of choosing between two 
alternatives in the light of the previous trial 
The extent of the deviation is taken as evidence 
of the consistency with which the strategy is 


followed. 


TABLE 4 


Incipence or Lert ann Ricut Cuoices, ann Proportion or Lert 


CHoices, 


Fottowinc Four Kinps or Precepinc Triars 
(Data for Trials 11-70 of the 50:50 Period) 


Event on Preceding Trial and Subsequent Choice 


Chose L and Won | Chose L and Lost 


50% 
83% 
5 Y% 
0% 


175 
201 
164 
194 


* Significant at 1% level 
** Significant at .1°% level 


Chose KR and Won 


Chose BK and Loat ital Choices 


kK LL 


45% 


153 
ae 
203**| 26% 


117 
177° 


49% 
19, 


Significance levels established with binomial probability paper 





390 


The results of this analysis are 
presented in Table 4. The only 
group showing extensive use of a 
strategy is the LL group. The 
strategy used is one of repeating a 
successful choice and the extensive- 
ness of its use is shown by the ratio 
of 5 to 1 in favor of repeating a suc- 
cessful left choice and 3 to 1 in favor 
of repeating a successful right choice. 
The control group (no initial pattern) 
shows a significant but less extensive 
tendency to use the same strategy of 
repeating a successful choice. This 
is a group which is apparently led by 
the experience of long runs in the 
50:50 period to repeat successful 
choices. As in the previous experi- 
ment (5), however, these Ss do not 
persist with a successful choice until 
the run is finished. Instead they 
attempt to switch to the opposite key 
before they have lost and at a point 
where they estimate that the run is 
about io finish. This same tendency 


to shift keys before an event change 
occurs is found in the LLR group, 
where Ss develop a strategy of re- 


peating a previously unsuccessful 
choice, but not persisting with it to 
the point where the key starts to pay 
off. 

As we expected, the strategy of 
regularly persisting with a previously 
successful choice provides a consistent 
form of response against which the 
final pattern of LRLR stands out 
clearly. Persistence with a _ pre- 
viously successful choice carries with 
it some provision for what to do when 
the run is finished and the choice 
leads to a loss. Some subjects adopt 
as their usual policy one of switching 
immediately after one unsuccessful 
choice. When these Ss meet the 
pattern of LRLR with this policy, 
the result is a consistent pattern of 
error. In the words of one S: “TI 
found I was always one step behind 


JACQUELINE J. GOODNOW AND THOMAS F. PETTIGREW 


the machine, so I knew it must be 
alternating.” Other Ss adopt as their 
usual policy one of persisting with a 
previously successful choice until that 
key has lost twice, and then switching 
to the opposite key. When the pat- 
tern of LRLR is met with this policy, 
the result is a run of left or right 
choices with every second choice 
successful. As one S remarked: “I 
was staying with left until it lost 
twice in a row, and I found I was 
winning every second time, so the 
machine had to be alternating.” 

We have stressed the presence of a 
consistent form of response as a way 
of explaining the speed of final 
learning in the LL group and the 
absence of a consistent form of re- 
sponse as a way of explaining the 
slowness of final learning in the LLR 
group. This interpretation is 
strengthened by the results obtained 
with a group not so far reported. 
This group started with LR and ended 
with LL. Here again we found no 
consistent mode of response during 
the 50:50 period and no facilitation 
of the final learning. 

It should be noted that we do not 
find within groups any relationship 
between degree of response struc- 
turing and speed of final learning. 
This lack of correlation is the result 
of three factors. One is that, beyond 
a certain point, increases in response 
consistency do not seem to give rise 
to any further advantages for final 
learning. The second is that meas- 
ures of response consistency, in order 
to be based on a sufficient sample of 
responses, have to be taken over 
almost the entire series of the 50:50 
phase, but response consistency, in 
order to be effective, may need only 
to occur toward the end of the 50:50 
phase and in meeting the final LR 
pattern. The third factor is that 
occasionally response consistency rep- 





SOME SOURCES OF DIFFICULTY 


resents a withdrawal from the problem 
rather than a tool for event obser- 
vation; S has abandoned belief in the 
possibility of finding a regular event 
pattern or of improving his perform- 
ance, settles for a form of re- 
sponse which enables him “to do well 
enough.” This response 
consistency is rare among our Ss, but 
its occasional occurrence, together 
with the two factors limits 
the possibility of a correlation within 
groups between degrees of response 
consistency and speed of final 
learning.* 


and 


source of 


other 


Discussion 


We are led to suggest two extensions 
to the concept of discriminability pro 
posed by Longenecker, Krauskopf, and 
Bitterman (11) to cover the differences 
they 
changes in 


found in speed of response to 
patterning. These 
extensions take the form of suggesting 
two ways by which the discriminability 


of event 


event 


patterns can be enhanced or 
obsc ured. 

When the initial and final patterns are 
the same, the initial experience may lend 
a useful distinctiveness, in Lawrence’s 
(10), to the initial pattern—a 
distinctiveness buttressed by the fact 
that Ss do expect patterns previously 


found important to recur in later ex 


sense 


perience with the same stimulus material. 
It may that 
confers distinctiveness not only upon one 


even be prior experience 
particular pattern but also upon similar 
patterns, and that the extent to which 
later learning is facilitated 
function of the degree of similarity 
between initial and final patterns. This 
latter relationship is not likely to be a 


will be a 


simple one, however, since Ss expect 


change to proceed not only along a 


*To test for such correlation, we used 


Hyman’s (8 


extent of contingency 


procedures, allowing measures of 
between response on one 
trial and (a) the response on the previous trial, 
(b) the alternative correct on the previous trial, 
and (c) the combination of response and outcome 
on the previous trial 


"IN SOLVING SIMPLE PROBLEMS 


91 


dimension of similarity (e.g., all payoffs 
on one side or the other, or all in some 
form of alternation), but the 
direction of increasing rather than de 
creasing complexity. 

It is when the initial and final patterns 


also in 


are not the same that the second source 


of discriminability becomes important. 
This source is the consistency of the mode 
of response established by prior experi 
ence. Response consistency provides a 
standard set of operations for S to follow 
and a steady background against which 
the event pattern can stand out in relief. 

These are certainly 


not the only two 


sources by which an event pattern can 

Any learn 
which leads § 
oriented rather 


onented 


be made more discriminable 
ing set or any experience 


to become event than 


response-hypothesis should 
the 


such as “don’t be blind” 


serve same function. Instructions 
(12), training 
until all the 


data are in (18), experience which lowers 


in suspending judgment 


one’s level of anxiety and concern with 
questions of success and failure (9) 

all these have the function of directing 
S's toward the 
than toward his 
hy pothese s. We 
phasize the role ot response consistency 


attention events rather 


own responses and 


have chosen to em 
partly because of its intrinsic significance 
and partly as a the 
tendency to regard response consistency 


counterweight to 


solely as stereotypy and therefore some 
(13). 


own responses constant ts as important 


how disfunctional Keeping one’s 


in experimental pattern learning as using 
standard measures is in scientific problem 
solving. 


SUMMARY 


Each of three groups ¢ 
that there is 


tf Ss starts by le arning 
yoff on a 
LR, LL 
s then shifted to a 
payoff for 70 trials 
LR is 


A control group expenence no i 


a consistent pattern of pa 


gambling device—either 


or LLR. Each § 


two-choice 


(all left = 


ran kc m 


finally a 


and 


schedule of 
{ 


pattern 0 ntroduced 
groups 
pattern but starts with the randor 
Experience with initial LR or with LI 
with LLR, leads to faster final learning o 
The results are ex lained in 


the final 


terms of prev 


experience making pattern more 


criminable, with discriminability strongly 





392 JACQUELINE J. GOODNOW AND 


fluenced by Ss learning to adopt a consistent 
mode of response against which an event pattern 
stands out clearly rather than being masked by 
response variability. 


REFERENCES 


. Avams, D. K. Experimental studies of 
adaptive behavior in cats. Comp. psy- 
chol. Monogr., 1929, 6, No. 1 (Whole No. 
27). 

. Brown, W., & Bue, J. Response tend- 
encies and maze patterns as determiners 
of choice in a maze. J. comp. psychol., 
1940, 29, 337-399. 

. Getteaman, L.W. ‘The double alternation 
problem: II. The behavior of children 
and human adults in a double alternation 
temporal maze. J. genet. Psychol., 1931, 
39, 359-392. 

. Gooonow, J. J. Determinants of choice in 
two-choice situations. Amer. J. Psychol., 
1955, 69, 106-116. 

. Goopnow, J. J.,& Perricrew,T.F. Effect 
of prior patterns of experience upon 
strategies and learning sets. J. exp. 
Psychol., 1955, 49, 381-389. 

. Hetosreper, FE. An experimental study of 
thinking. Arch. Psychol., 1924, 11, No. 
5 (Whole No. 73). 

. Hitcaro, E. R. The role of learning in 
perception. In R. R. Blake & G. V. 
Ramsey (Eds.), Perception—an approach 
to personality. New York: Ronald, 
1951. Pp. 95-120. 

. Hyman, R., Porrer, M. C., & Dinwoopte, 
W. D. Personality correlates of be- 
havioral stereotypy. J. abnorm. soc. 
Psychol., in press. 

. Keister, M. FE. The behavior of young 
children in failure. In R. G. Barker, 


THOMAS F. PETTIGREW 


J. S. Kounin, & H. F. Wright (Eds.) 
Child behavior and development. New 
York: McGraw-Hill, 1943. Pp. 429- 
440. 

. Lawrence, D.H. Acquired distinctiveness 
of cues. I, Transfer between discrimi- 
nations on the basis of familiarity with 
the stimulus. J. exp. Psychol., 1949, 39, 
770-784. 

. Loncenecxer, B. D., Krausxorr, J., & 
Birrerman, M. E. Extinction following 
alternating and random partial reinforce- 
ment. Amer. J. Psychol., 1952, 65, 580- 
587. 

. Lucuins, A. S. Mechanization in problem 
solving: the effect of Einstellung. Psy- 
chol. Monogr., 1942, 54, No. 6 (Whole No. 
248). 

- Marquart, D. I. The pattern of punish- 
ment and its relation to abnormal fixation 
in adult human subjects. /. genet. 
Psychol., 1948, 39, 107-144. 

. Mostrerrer, F., & Busu, R. R. Selected 
quantitative techniques. In G. Lindzey 
(Ed.) Handbook of social psychology. 
Cambridge, Mass.: Addison-Wesley, 
1954. Pp. 289-334. 

. Mosre.res, F., & Tuxey, J. W. The uses 
and usefulness of binomial probability 
paper. J. Amer. Stat. Ass., 1949, 44, 
174-212. 

. Prentice, W.C. H. Continuity in human 
learning. J. exp. Psychol., 1949, 39, 
187-194. 

. TuHornotxe, E. L. Animal intelligence. 
New York: Macmillan, 1911. 

. Tuornoixe, E. L. How children learn the 
principles and techniques of problem 
solving. Chicago: Natl. Soc. Stud. 
Educ., 49th Yearb., Part l. Pp. 196-216. 


(Received June 7, 1955) 





Journal of Experimental Psychology 
Vol. 51, No. 6, 1956 


EFFECTS OF STIMULUS 


SIMILARITY ON 


DISCRIMINATION LEARNING 


CHARLES C. SPIKER! 
lowa Child Welfare Research Station 


Increase in the difficulty of a dis- 
crimination task, produced by increase 
in the similarity of the discriminal 
stimuli, has been explained in terms of 
increased generalization of incorrect 
approach and inhibitory tendencies 
(1, 3,4). Observations of children in 
discrimination problems have led this 
investigator to consider the possibility 
that increase in stimulus similarity 
increases task difficulty through the 
introduction of factors in addition to 
stimulus generalization. ‘The present 
experiment is designed to test for the 
presence of such additional factors 


and to attempt to provide a measure 
of stimulus generalization that is not 
contaminated by them. 


A successive discrimination task in 
which one or more of the available 
responses is always incorrect offers a 
means of isolating task difficulty con- 
tributed by stimulus generalization 
and that contributed by other factors. 
Increase in stimulus generalization 
would be expected to increase gen- 
eralized (intralist) errors, but it 
would not be expected to increase the 
frequency of occurrence of responses 
that are never correct (extralist 
errors). On the other hand, if in- 
crease in stimulus similarity increases 
the potency of inattention, disinterest, 
frustration due to failure, etc., one 
would expect an increase in the fre- 
quency of both intralist and extralist 
errors. 

' The 


Gerjuoy 


indebted Dr. Irma R. 
who participated in the preliminary 
experiments upon which this experiment is based. 
He also wishes to thank his research assistant, 
Miss Ruth B. Holton, for her aid in collecting 
the data. 


writer is to 


393 


MetTHuop 


A pparatus.—The apparatus was a black box, 
18 K 24 & 16 in. Six inches above a 3-button 
response panel was a 3-in. diameter aperture of 
flashed opal glass. White geometric figures 
(square, circle, or triangle) were painted around 
each of the buttons, each of which was approxi- 
mately 7 in. from the other two. Two 75-w 
projection lamps were mounted 6 in. behind the 
aperture, each enclosed in a light-proof reflecting 
container on the front of which a 2 X 2-in. 
glass color filter could be inserted. The £ 
could turn on either one of the lamps and pre- 
determine which of the 
break its circuit. For each S, one button did 
not turn out either lamp. If S pushed the 
correct button, the lamp was turned out and a 
marble automatically delivered to S._ Incorrect 
responses resulted in the disconnection of all 
buttons, thereby preventing more than one 
response per trial. 

Experimental design.—Three experimental 
groups differed only in the discriminal stimuli 
presented. The stimuli were obtained by 
diffusing light through glass color filters (Corning 
Glass Works) aperture. Group HS 
was presented with a yellow-green (No. 4015) 
and a green (No. 4308 superimposed on No. 
3484). Group S was presented with the yellow 
green and a blue-green (No. 3385 superimposed 
on No. 4303). Group D received the yellow 
green and a deep blue (No. 5330). The stimuli 
in each pair are readily discriminated by adults 
with normal color vision. Within these groups, 
the position of the disconnected button, which 
was never correct, was controlled by randomly 
assigning Ss to three subgroups. 

Subjects.—Sixty-five preschool Ss from the 
Iowa Child Welfare Research Station labora- 
tories were randomly assigned to the experi- 
mental groups—25 to Group HS, 22 to Group S, 
and 18to Group D. The Ss ranged in age from 
4yr.,2mo.toSyr.,8mo. Some Ss had previous 
experience with simultaneous and successive 
discrimination learning problems, but none had 
previous experience with the present apparatus 

Procedure.-A familiar E brought S to the 
experimental room and allowed him to select one 
of a number of dime-store toys which he would 
attempt to win. The E then demonstrated the 
two stimulus lights one at a time, instructing S 


three buttons would 


onto the 





394 


that marbles could be obtained by turning out 
the lights, and that the marbles could be used 
to buy the selected The appropriate 
stimuli were presented in a prearranged order 
that was the same for all Ss. Each light ap- 
peared three times in successive blocks of six 
presentations, long series of alternations were 
avoided, and neither light appeared more than 
three times in succession. Button (position) 
preferences were verbally discouraged by E. 
All Ss received at least 30 stimulus presentations 
(trials); Ss who failed to attain 9 correct choices 
in 10 consecutive trials by ‘Trial 30 were given 
additional trials until this criterion was attained 
or until 96 trials had been given. Eleven of the 
65 Ss failed to reach this criterion in 96 trials. 


toy. 


ResuLts 


Analysis of the total error scores 
for the subgroups, which differed only 
in terms of the position of the dis- 
connected button, revealed no con- 
sistent or significant differences among 
them. Accordingly, the data for 
these subgroups have been combined 
within each experimental group. The 
scores used in subsequent analyses are 
the number of errors involving the 
button that was connected (intralist 
errors) and the number of errors 
involving the button that was dis- 
connected (extralist errors). 

Table | shows the means of the two 
types of error for each of the three 
experimental groups. From inspec- 
tion, it is apparent that both error 
types tend to increase in frequency 
with increase in stimulus similarity, 
and that the magnitude of the differ- 
ence between the means of the two 
error types also increases with increas- 


TABLE 1 


Means or tHe Two Tyres or Error 


Total 
Errors 


Extralist | 


| Intralist 
| Errors | 


G N | 
sTOUDS Errors | 


28.92 
24.59 
9.72 


5 | 21.48 
2 17.64 
8 | 6.44 
5 16.02 


7.44 
6.95 
3.28 
6.12 


HS 
S 


Can 
Combined | 


2 
2 
l 
6 


CHARLES C. SPIKER 


TABLE 2 


ANALYsis OF VARIANCE oF NUMBER 
or Errors to Crirerion 


Mean 
dj 
Square 


| 1014.28 


Source 


Ss 


= 


| 
6.09 |. 
Error (b) 
Within-Ss: 
Error Type (E) | 
Exs 
Error (w) | 
Total 


~ 
So 


Stimuli (S) | 


oO 
RrK Aww 


=_ 
rua 
- 


ing stimulus similarity. For each 
group, the mean frequency of intralist 
errors is greater than that of extralist 
errors. 

The statistical significance of these 
trends was tested by a Lindquist 
Type I analysis (2) and by further 
tests of the simple effects. The 
results of the Type I analysis are 
presented in Table 2. Here it is 
apparent that the experimental groups 
differ significantly in terms of mean 
total errors, and that frequency of 
intralist errors is significantly greater 
than the frequency of extralist errors 
for the combined experimental groups. 
Furthermore, there is an interaction 
(P < .05) between the = stimulus 
groups and the type of error. The 
interaction indicates that, with in- 
creasing stimulus similarity, there is 
an increase in the magnitude of the 
difference between the mean fre- 
quencies of the two types of error. 

The data were analyzed as two 
simple randomized designs to deter- 
mine the stimulus effects on the two 
types of error separately. For the 
intralist errors, the stimulus effects 
were significant at the 1% level 
(F = 5.54; df = 2,62). For extralist 
errors, the stimulus effects were sig- 
nificant at the 2.5% level (F = 4.25; 
df = 2, 62). Using the ¢ test, com- 
parisons were made of pairs of groups. 





DISCRIMINATION 


For the intralist errors, the mean of 
Group D differs significantly from 
that of Group HS (P < .O1) and from 
that of Group S (P < .02). The 
means for Groups HS and § do not 
differ significantly. For the extralist 
errors, also, the Group D mean is 
significantly lower than the Group 
HS mean (P < 
S mean (P < .02), but the means for 
Groups HS and § are not significantly 
different. 
stimulus 


OL) and the Group 


Clearly, an increase in 


similarity resulted in an 
increase of errors of both types. 
Finally, ¢ tests were made of the 
significance of the differences between 
means for intralist and extralist errors 
for the 
separately. 


three experimental 
Significantly 
tralist errors than extralist errors were 
made by Groups HS and S$ (P < .001), 
but the difference for Group D was not 
statistically significant (P > .05). 


groups 
more in- 


DiscUSSION 


The greater frequency of intralist than 
of extralist errors presumably reflects the 
operation of 
which, in the 
expected to 
without directly 


stimulus 
present 


generalization, 
experiment, is 
intralist errors 
increasing the number 
of extralist errors. The interaction be 
tween similarity type of 
error is interpreted as the result of an 
increase in stimulus generalization with 
increasing stimulus similarity. 


inc rease 


stimulus and 


This in- 
teraction is somewhat more statistically 
reliable (P < .01) when only the extreme 
groups are considered. It would seem 
that the interaction provides an index of 
the amount of generalization that is not 
affected by other factors that may be 
associated with increased stimulus simi- 
larity. 

The increase in frequency of extralist 
errors with increasing stimulus similarity 
is only indirectly increased 
stimulus generalization. A task made 
dificult by high generalization is made 
even more difficult through loss of 


due to 


LEARNING 395 


number of 
failures; elicitation of inappropriate re 


interest due to increased 
away from stimuli, 
asking to leave the situation, hitting 
buttons indiscriminately, etc.); and the 
development of fatigue due to longer 
experimental sessions. 


sponses (orienting 


Qualitative ob 
servations suggest that such factors were 
more frequent among Ss given highly 
similar stimuli than among those given 
distinctive stimuli. 

The evidence indicates that errors on 
pro 
vide an overestimate of the direct effects 
of stimulus Children’s 
appears to be 


a discrimination learning task may 


generalization. 
performance, at 
affected by similarity in a 
manner that can only partly be attrib- 
uted directly 


least, 


stimulus 


to stimulus generalization 


SUMMARY 


Three groups of preschool children were given 
task in 
(intralist) or nongeneralized 
could be The condi 
tions for the three groups differed only in the 
The re- 


sults indicated that increased stimulus similarity 


a successive discrimination learning 
which generalized 
extralist) 


errors made 


similarity of the discriminal stimuli 


resulted in greater numbers of both types of 


error and an increased difference between the 


means of the two ty pes of error These results 
were interpreted as reflecting increased stimulus 
ge neralization as a function of increase ! stimulus 
in extralist errors with 


similarity. ‘The increase 


increased stimulus similarity was interpreted 
as reflecting the operation of factors in addition 
dec rease in 


to stimulus gen ralization, c.2., 


interest, motivation, or attention, and increase 


in fatigue 


REFERENCES 


Huu, ¢ L, { behavior system 
Haven: Yale Univ Press, 1952 


Linpquist, FE. F. Design and analysis of 


New 


experiments in prychology and education 
Boston: Houghton Mifflin, 1953 
Srence, K.W 
learning in animals 
43, 427-449 
Srence, K. W The differential 


in animals to stimuli 


The nature of discrimination 


Psychol. Rev, 1936, 


response 
varying within A 
Psychol, Reev., 1947, 


single dimension 


44, 4350-444 


(Received June 15, 1955) 








Journal of Experimental Psychology 
Vol. 51, No. 6, 1956 


CONSTANT-SUM JUDGMENTS OF FACIAL 
EXPRESSIONS ! 


TRYGG ENGEN AND NISSIM LEVY 


Brown University 


The main purpose of this study was 
to determine the potentiality of the 
constant-sum method in psychometric 
scaling. An answer was sought to 
the question as to whether or not the 
constant-sum method could be suc- 
cessfully applied to scaling of pictures 
of facial expression. The method 
deserves attention because it may 
yield ratio scales, but it has thus far 
been used only in traditional psycho- 
physical experiments, i.e., judgments 
of lines and weights have been com- 
pared to physical scales (1, 2, 3, 4). 

A second aim was to compare scales 
of facial expressions developed by the 
method of rating and the constant- 
sum method. For this purpose, there 
are available the ratings provided by 
Schlosberg (8, 9, 10) on the dimen- 
sions of pleasantness-inpleasantness 
(P-U), attention-rejection (A-R), and 
sleep-tension (S-T). 


Metuop 


Three group experiments were undertaken, 
and each employed the same experimental pro- 
cedure, Fifteen Frois-Wittmann pictures were 
selected to sample eight values on each of 
Schlosberg’s (10) three dimensions. Pictures 
58, 41, 2, 26, 43, 51, 31, and 70 from the Frois- 
Wittmann series (5) were chosen to represent the 
P-U dimension, Pictures 41, 21, 51, 12, 43, 69, 
26, and 28 the A-R dimension, and Pictures 52, 
51, 67, 12, 70, 58, 36, and 34 for the S-T di- 
mension. Some of the pictures represent more 
than one dimension, i.c., Pictures 41, 26, 43, 
and 51 occur in the samples of both P-U and 
A-R, and Pictures 58, 51, and 70 represent both 
P-U and S~T, and Picture 51 appears in all three 
dimensions. 

Photographic slides were made of the 28 
possible pairs of the 8 pictures representing each 


'This paper was presented at the 1955 
meetings of the Fastern Psychological Asso- 
ciation in Philadelphia. 


dimension, and these were projected on a screen 
for group presentation. The slides were pre- 
sented in a random order with the one stipulation 
that the same picture could not appear in two 
successive slides. ‘There were two presentations 
of each series of 28 slides, and these were counter- 
balanced for series and right-left position effects. 

The same students in an introductory class in 
psychology served as Ss for the three experi- 
ments, which were one week apart. The N, in 
order of presentation of the dimensions, was 42, 
37, and 39 for the P-U, A-R, and S-T dimen- 
sions, respectively. The Ss recorded their 
responses on prepared data sheets. They were 
given approximately 30 sec. for each judgment, 
which was ample time as determined by pre- 
liminary trials on other Ss. It should be noted 
that Ss complained about the difficulty of the 
task, although each dimension was carefully 
defined. The instructions emphasized that 100 
points were to be divided between the members 
of each pair of pictures in terms of the relative 
amount of pleasantness (or attention or tension) 
expressed in the faces. The method was 
demonstrated with vertical lines drawn on the 
blackboard and with pictures from the Ruckmick 
(7) series representing two relative extremes of 
each dimension. ‘The Ss were discouraged from 
using the 100-0 division for any of the pairs on 
the basis that these pictures probably did not 
represent the most extreme emotional expres 
sions they could imagine. They were en 
couraged to use any other division of 100 they 
thought appropriate and not to restrict them- 
selves to five or ten point divisions. 


Resutts anv Discussion 


Scale values were developed from 
the Ss’ point assignments by formulae 
presented by Comrey (2), and the 
results of these computations are 
presented in Table 1 along with the 
Schlosberg rating values for the same 
pictures. As far as ordinal charac- 
teristics are concerned, the results of 
the present three experiments are in 
almost complete agreement with those 
of Schlosberg. For example, it should 


396 











JUDGMENTS OF FACIAL EXPRESSION 


TABLE 1 


Constant-Sum Metunov (CSM) ann Ratinc Metuop (RM) Scare 
Dimensions or P-U, 


VALUES ON 


\-R, anv S-T 


Picture numbers are Frois-Wittmann’s (5) 


| 
Picture | 
Number 


x 
Zz 
+4 
f 
= 


oS NN Se 

> coCoNS 
> BD tee ee oe es 

awe 


a 


DHAMUIMINMNwh 


-~rm 
~ 


mh = 


Note The S 


A 


-R 


| 
Picture | 
Number | 


T data, which have not been published previously 


Picture 


Number 


x 
= 


N=—Nm > 


i) 


SNe ew ee 


i 
SIN O &eN 


were kindly supplied by Prof. H. Schlosberg 


They had been obtained by the same rating method used for the other two dimensions with an N of 20 


be noted that some of the pictures 
(i.e., 26, 43, 51, 58, and 70), which 
had markedly different scale values 
on Schlosberg’s three dimensions, 
likewise obtained different scale values 
on the three constant-sum scales. 
The tau rank-order correlations be- 
tween the scales developed with the 
two methods are .96 for P—U, .92 for 
A-R, and .92 for S—T. 

Only one large inconsistency was 
observed. This was Picture 41 on 
the A-R scale. However, the con- 
stant-sum method scale value is 
consistent with the Woodworth scale 
position given by Schlosberg (8) in 
1941. This picture may well have 
been misplaced in Schlosberg’s (9) 
1952 study either through error or 
through some obscure series effects. 

The present results demonstrate S’s 
ability to describe facial expressions 
with numbers. The direct estimates 
of the ratio between adjacent stimulus 
pairs (e.g., A/B) are in good agree- 
ment with the indirect estimates 
(e.g., A/B A/C B/C, 
The means of the differences between 
the six indirect and the one direct 
estimates were .05, .15, and .09 for 
slightly positively skewed distribu- 


etc. }. 


tions with SD’s of .21, .33, and 
for A-R, S-T, and P-U, respectively. 
In the case of the most variable di- 
mension, this means that if the direct 
estimate of A/B was 2/1, which is 
reflected in a 67-33 point assignment, 
about 68% of the indirect estimates 
were between 64-36 and 70-30. The 
small variability is surprising in view 
of Ss’ complaints about the difficulty 
of the task. 
compares 
results. 

obtained 


25 


This internal consistency 
well with (2) 
Our reanalysis of his data 

with the 
method on judgments of lines yields 
a mean of .05 and an SD of .15, i.e., 
with a direct estimate of 67-33, 68% 
of the indirect point assignments were 
35 and 68-32. 
general 


Comrey’s 


constant-sum 


between 65 
The gor rd be- 
tween the scales developed by the two 


agreement 


methods and the internal consistenc y 
of the constant-sum scales render it 
the 
question which led to these experi- 
ments. 


feasible to answer affirmatively 


That is to say, it has been 
demonstrated that the constant-sum 
method can be applied with success 
to psychometric dimensions. 
the high agreement 
constant-sum 


Finally, 
the 
the 


between 


judgments and 














398 


ratings lends evidence to the stability 
of Schlosberg’s dimensions. 


SUMMARY 


Fifteen of the Frois-Wittmann pictures of 
facial expressions were scaled by the constant- 
sum method and compared with Schlosberg’s 
rating scales of the same pictures on his dimen- 
sions of pleasantness-unpleasantness, attention- 
rejection, and sleep-tension. ‘The constant-sum 
scales were in good agreement with the rating 
scales as far as ordinal characteristics were con 
cerned, Further, the constant-sum scales 
showed high internal consistency in terms of 
interratio agreements, although not quite as 
high as has been found with this method in 
psychophysical scaling of lines. The results 
provide evidence that the constant-sum method, 
which may yield ratio scales, can be extended to 
material more complex than lines and weights. 


REFERENCES 
1. Baker, K. E., & Dupex, F. J. Weight 


scales from ratio judgments. J. exp. 
Psychol., 1955, $0, 293-308. 


5 


“. 


wv 


~ 


10. 


. Guitrorn, J. 


. Huu, W. S., &@ Katz, D. 


. Merresser, M. 


. Scurosperc, H. 


. Scurosperc, H. 


TRYGG ENGEN AND NISSIM LEVY 


Comrey, A. L. A proposed method for 
absolute ratio scaling. 
1950, 15, 317-325. 

Encen, T. 
developing ratio scales. 
chol., in press. 

P., & Dincman, H. F. A 

validation study of ratio-judgment meth- 


Amer. J. Psychol., 1954, 67, 395 


Psychometrika, 


An evaluation of a method for 


Amer. J. Psy- 


ods. 
410. 
The Frois 
Wittmann pictures of facial expression 
J. exp. Psychol., 1935, 18, 482-498 
A proposal for quanti 
tative reporting of comparative judg 
ments. J. Psychol., 1947, 24, 229 
Rucxmick, C. A. A preliminary study of 
the emotions. Psychol. Monogr., 1921, 
30, No. 3 (Whole No. 136). 5 


235. 


Pp. 10-35. 

A scale for judgment of 

facialexpressions. J. exp. Psychol., 1941, 

29, 497-510. 

The desc ription of facial 
expressions in terms of two dimensions. 
J. exp. Psychol., 1952, 44, 229-237. 

Scutosperc, H. Three dimensions of 

Psychol., Rev., 1954, 61, 81-88 


emotion. 


(Received June 27, 1955) 


’ 





Journal of Experimental Psychology 
Vol. 51, No. 6, 1956 


PERCEPTUAL BEHAVIOR AS RELATED TO FACTORS 
OF ASSOCIATIVE AND DRIVE STRENGTH 


JEAN ENGLER! AND JAMES T. 


FREEMAN 


lowa State College 


Much of current behavior theory 
regards behavior as being determined 
by at least two general 
variables. These 


classes of 
are usually 


associative or 


con- 
sidered as learning 
variables and drive variables. The 
most explicit statement of the func- 
tional properties of these variables has 
been expressed by Hull (6) who con- 
siders performance or response (R) to 
be a function of the interaction of 
habit strength (//) and drive (D). A 
number of studies have been carried 
out within the above framework in 
which anxiety has been manipulated 
as a drive variable. 


Anxious Ss have been reported to condition 
more readily and to extinguish less readily on 
both the eyelid reflex (14, 15, 16) and the gal- 
vanic skin reflex (4, 20) than nonanxious Ss. 

Farber and Spence (2) reported that the maze 
performance of anxious Ss was significantly 
poorer than that of nonanxious Ss with the 
greatest difference at the most difficult points. 
Taylor and Spence (18) noted that the per- 
formance of anxious Ss was inferior to that of 
nonanxious Ss in a learning situation involving 
the presence of competing response tendencies 
with the greatest difference for the more difficult 
task. 

Montague (8) reported that anxiety increases 
the difference weaker 
response situation, 
aiding performance if correct tendencies are 
Ramond (12) reported that in two- 
choice response type of paired-associate learning 
nonanxious Ss responded correctly significantly 
more often than did anxious Ss on those pres- 
entations for which the weaker of two possible 
There was no significant 
difference between performance of the anxious 
and nonanxious Ss on those presentations for 
which the stronger response was correct. 


between stronger and 


tendencies in a learning 


dominant. 


responses was correct. 


Recent trends in perceptual theory 
have indicated that much of per- 


1 Now at Northwestern University. 


ceptual behavior may be approached 
from the standpoint of learned re- 
sponse dispositions whose properties 
may be described in terms of general 
principles of associative learning. It 
should be noted that such formula- 
tions do not necessarily preclude the 
existence of 
nants. 


determi- 
There would thus seem to be 
an apparent 


motivational 


parallel between, say, 
Hullian behavior formulations on the 
one hand and those recently expressed 
by Postman (9), for example. 

In a the 
strength of response dispositions may 


perceptual situation, 
be varied in two ways (among others) : 
(a) by varying the frequency char- 
acteristics of the stimulus materials 
(5, 7, 13, 21), and (6) by manipulating 
the set of the Ss (3, 10, 11). The 
above studies, taken together, indicate 
that the higher the frequency of the 
stimulus words and the more re- 
strictive the set of S, the easier will 
be the task (i.e., the 
greater the response disposition, the 
less the stimulus information needed, 
and hence the lower the recognition 
threshold). 
that in 


perceptual 


Past studies also indicate 
learning high- 
anxiety Ss are superior to low-anxiety 
Ss in performance on simple tasks 


situations 


while low-anxiety Ss are superior to 
high-anxiety Ss in performance on 
more complex tasks. 

The present research is concerned 


with investigating perceptual recog- 
nition thresholds as a function of 
associative strength (or strength of 
the response disposition) as deter- 
mined by word frequency and set, 
and as a function of 


drive (moti- 








400 


vation) as determined by anxiety. 
In line with the foregoing consider- 
ations, it is hypothesized that high- 
anxiety Ss will have lower recognition 
thresholds than low-anxiety Ss to 
high-frequency words under the set 
condition, i.e., these Ss should perform 
better in a simpler perceptual task. 
It is further hypothesized that the 
low-anxiety Ss will have lower recog- 
nition thresholds than high-anxiety 
Ss to low-frequency words under the 
no-set condition, i.e., these Ss should 
perform better in a more complex 
perceptual task. 


Metuop 


Subjects. —The Ss were 80 undergraduate 
students in introductory psychology classes at 
lowa State College. 

Apparatus.—The apparatus was a Harvard- 
type tachistoscppe calibrated in .O1 sec. The 
size of the exposure field was 10 X 10 in. with 
the stimulus words (typed in elite capitals) 
exposed through an aperture 1 X 3 in. centered 
in the exposure field. The intensity of illumi- 
nation of the exposure field measured at the 
aperture was 6.0 ft.-candles. The distance of 
the exposure aperture from S’s eye was 28 in. 

The size of the pre-exposure field was 10 K 10 
in. with the fixation point (a cross 4 X § in.) 
centered in the pre-exposure field. The in- 
tensity of illumination of the pre-exposure field 
at the fixation point was 5.0 ft.-candles. The 
distance of the pre-exposure fixation point from 
S's eye was 22 in. 

All illumination intensity measurements were 
obtained directly from a Weston Illumination 
Meter, Model 603. 

Experimental design.—Matched groups con- 
sisting of five high-anxiety and five low-anxiety 
females and five high-anxiety and five low- 
anxiety males served in each of four conditions. 
The conditions involved combinations of high- 
and low-frequency stimuli and set and no-set 
instructions, 

Procedure for selecting Ss—The MMPI A 
scale as developed and revised by Taylor (17) 
was administered to 200 male and 200 female 
students. The distributions of scores for males, 
for females, and for the total sample were plotted 
and compared to the distribution obtained by 
Taylor. All three distributions agreed in shape 
with Taylor’s distribution, being positively 
skewed. The total-sample distribution had a 


JEAN ENGLER AND JAMES T. FREEMAN 


significantly higher median than that reported 
by Taylor (P = .03) and the median and both 
quartiles were significantly higher for males than 
for females (P = .02). 

The 20 males and 20 females with the highest 
A scores (24 to 40) and the 20 males and 20 
females with the lowest A scores (2 to 10) were 
used as Ss. 

Experimental procedure.—The same _ three 
practice words were used for each S and the mean 
recognition threshold for the second two of these 
was used as a practice measure for matching the 
four groups, which were then randomly assigned 
to the experimental conditions? Following the 
three practice words Ss were given the experi- 
mental instructions and 10 stimulus words for 
the conditions to which they had been assigned. 
The words were presented in random order for 
each S. Recognition thresholds and pre- 
recognition hypotheses were recorded. Ex- 
posure proceeded by .01 sec. from zero to .20 
sec., by .02 sec. from .20 to .30 sec., and by .03 
sec. beyond .30 sec. The point of origin was 
varied throughout successive presentations in 
an attempt to conteract any response sets built 
up by S. Two consecutive correct responses 
constituted a recognition threshold which was 
recorded as the exposure time on the first of these 
responses. The reading was then converted to 
milliseconds for purposes of statistical treatment. 

The instructions read to Ss were as follows: 
(a) General instructions (read for practice and 
no-set conditions) : “I am going to show you some 
words one at a time and I want you to tell 
me everything you see or think that you see as 
quickly as you can. I will show the words to 
you for only a brief moment but I want you to 
tell me everything that you see or think that you 
see. The words will appear where the X is now. 
When I call ready, please look at the spot where 
the words will appear. The same word will be 
shown until you have correctly identified it twice 
in a row. Any questions? Ready.” (b) Set 
instructions: “Some, but not all, of the words 
I’m going to show you now will be animal words, 
that is, words standing for animals. Your task 
is to recognize each word just as quickly as you 
can and report it to me. Remember, some but 
not all of the words will be animal words. Each 
word will be shown to you until you correctly 
identify it twice in a row. Any questions? 
When I call ready, please look at the spot where 
the words will appear. Ready.” 

Stimulus lists were constructed to be of 
homogeneous intralist frequency, that is, words 
within a given list were of essentially the same 


* Five Ss were discarded in the process of 
matching the groups for sex and for high and low 
anxiety. 








PERCEPTUAL BEHAVIOR 


Thorndike-Lorge general word count (19). 
Each list consisted of four critical words (words 
consistent with set) and six neutral words (words 
not consistent with set). Length was held 
constant insofar as was possible. The practice 
words (medium frequency) were brick, straw, 
and lunch. The stimulus lists were as follows: 


High-frequency 
critical neutral 
chicken 
horse 
rabbit 
sheep 


Low-frequency 
critical neutral 
gazelle align 
opossum corrode 
raccoon crimp 
sloth fluster 
humid 
terse 


among 
certain 
refuse 
remove 
there 
under 


RESULTS 


For each S the mean recognition 
threshold for the four critical words 
and the six neutral words served as 
the basic data upon which an analysis 
of variance was performed. This 
analysis appears in Table 1. It may 
be seen that such variables as fre- 
quency, words, and words by set were 
all highly significant, which is con- 


TABLE 1 


Anatysis or Variance or Mean 
Recocnition THREesHoLps 


Source MS 


Between Ss 
Anxiety (A) 
Set (S) | 
Frequency (F) | 19.45°* 
AxXS 2 5.48° 


223.73 
8302.17 


AXF 
SX F 
AXS 


x F 
Error 
Total 

Within Ss 
Words (W) 


2939.51 
677.24 
142.08 
4.11 

1059.72 








—. . . ) 


™“ 


x 


a 





* Significant at .03 level 
** Significant at <.01 level 


g 


( MRLISECONDS 


~s 
Le 


NEUTRAL «WORDS 


c 


THRE SHOL 


CRITICAL WORDS 


= 





MEAN RECOGNTION 





bj 
IGH FREQUENCY Low FREQUENCY 


Fic. 1. Mean thresholds for 
critical and neutral words as a function of set and 
frequency of usage of the stimuli 


recognition 


sistent with a number of previous 
studies. Of more critical importance 
to the tests of the hypothesis of the 
present study, however, is the analysis 
of the interactions, particularly those 
involving set, anxiety, and frequency. 
A graphical representation of the Set 
X Frequency interaction, which ap- 
proached statistical significance (F of 
3.60; 3.98 required at .05 level), is 
presented in Fig. 1. The general 
effects of frequency (high vs. low) 
upon over-all thresholds are evident, 
and it is to be noted that the effect 
of set upon the critical words is more 
apparent in the the low- 
frequency material (t = 3.07, P 
= 01). For high-frequency ma- 
terial, the effect of set upon critical 
words was not significant although 
the effect of set was associated with 
significantly 


case of 


higher thresholds § (t 
= 4.56, P< Ol) for the neutral 
words. In short, it would seem that 
the effect of set in the direction of 
lower thresholds was much more pro- 
nounced for low-frequency material 
while making little if any difference 
in the case of high-frequency material, 
which corroborates a_ finding 
viously noted by the writers (3). 


pre- 











402 








” 
‘ 
& OF 
@ 
v 
| 
Z ws} a 
| ™ NO ET 
g 20F NeuTRM WORDS _.-*~" _-db NO SET 
a Loe ae 
5 275r - -* . 
i a © ser 
é e er 
© 280k — CRITICAL =WORDS 
~ 
Vv 
@ 225+ 
3 ott 
ow—7 t 
HIGH ANKIETY LOW ANKIETY 
Fic. 2. Mean recognition thresholds for 


critical and neutral words as a function of set and 
anxiety level. 


A graphical representation of the 
interaction of set and anxiety (F 
= 5.48, P = .03) is presented in Fig. 
2. It should be noted that the effects 


of set for both critical and neutral 
words are differential as a function 
of anxiety level. This would be 


consistent with the failure to find 
any significant main effects of anxiety 
since the functions are, in fact, re- 
versed. In the case of the critical 
words, set results in lower thresholds 
for the low-anxious Ss (t = 2.73, 
P = Ol) while for the high-anxious 
Ss there is no significant difference in 
thresholds as a function of set. For 
the neutral words, there is no sig- 
nificant difference between set and 
no-set for the low-anxious Ss, while 
for the high-anxious Ss recognition 
thresholds were significantly lower 
(t = 2.00, P = .05) under the no-set 
condition. Apparently, set  func- 
tioned in the expected manner with 
respect to the critical words for the 
low-anxious Ss while resulting in 
raised recognition thresholds for the 
high-anxious Ss in the case of the 
neutral words. In addition, when 
the recognition thresholds to critical 
words under the set condition are 


JEAN ENGLER AND JAMES T. FREEMAN 


considered, there were no significant 
differences as a function of anxiety 
level although the thresholds to the 
neutral words under the set condition 
were significantly higher (¢ = 2.38; 
P = .03) for the high-anxious Ss. 

The results of the set and anxiety 
interaction (for all stimulus words) 
may be generally summarized as 
indicating a trend toward higher 
thresholds for the high-anxious Ss 
under the set condition while lower 
thresholds result for the high-anxious 
Ss under the no-set condition. 


Discussion 


Clearly, the effects of set were not 
uniform at different anxiety levels. In 
terms of our original expectations, set 
and frequency both were postulated as 
associative variables defining the dif- 
ficulty of the perceptual task. That this 
assumption is justified in the case of 
frequency is indicated by the more or 
less consistent effects of this variable 
upon thresholds. However, in the case 
of set, the fact that thresholds were 
actually higher for the high-anxious Ss 
under this condition rather than lower 
as was originally expected would appear 
to take exception to the consideration of 
set as providing an increment to associ- 
ative strength, at least for the high- 
anxious Ss. The consideration of set 
as an associative variable (as originally 
conceived) could apply to the perform- 
ance of the low-anxious Ss, however. 

It is apparent that the inconsistency 
of the functioning of set at different 
anxiety levels must suggest some ad- 
ditional process in operation here. The 
set and no-set conditions may be con- 
sidered as two conditions differing in the 
degree of competitive response tendencies 
involved, being greater in the former 
than in the latter. That is, under the 
set condition, S§ is set to discriminate 
between set and non-set (critical and 
neutral) words whereas this condition 
necessarily does not prevail under the 
no-set condition. Looking at the per- 
formance of the high-anxious Ss under 





PERCEPTUAL BEHAVIOR 


the set condition, then, which involves 
competition of response tendencies, one 
sees that these Ss perform poorer (higher 
thresholds) than the low-anxious Ss. 
However, under the no-set condition 
(no competing response tendencies) the 
performance of the high-anxious Ss is 
superior (lower thresholds) to that of the 
low-anxious Ss. Such an interpretation 
of the results would tend to be consistent 
with the general findings of Spence and 
his associates (8, 12, 18) that in a com- 
petitive response-type situation, anxious 
Ss tend to perform poorer than non- 
anxious Ss, whereas they perform better 
in a simpler situation. 

Therefore, within the present experi- 
ment, the set condition, paradoxically, 
would have to be considered the more 
dificult behavioral situation, while the 
no-set condition would be simpler from 
the standpoint of competing response 
tendencies. Obviously, such a_ con- 
ception of the properties of set is quite 
contradictory to the traditional view 
which would contend that competing 
response tendencies or conflicting hy- 
reduced 

‘That such a competitive situ- 
does develop is quite feasible 
though, especially in light of our results 
which might be said to indicate a dif- 
ferential effect of competitive response 
tendencies upon performance, the effects 
being less in the case of the low-anxious 
Ss and greater in the case of the high- 
anxious Ss. 

If the performance of the high- and 
low-anxious Ss under the set condition 
for the two types of words is considered, 
it may be noted that the differences in 
thresholds for the neutral words is much 
greater than the differences for the critical 
words as a function of anxiety level. 
This would seem to indicate that it is 
the difference in the recognition of the 
neutral which most markedly 
distinguishes the performance of these 
two groups. Put still another way, it 
would suggest that among the effects 
upon perception of increasing the levels 
of motivation is the impairment of 
recognition of stimuli which are not 
appropriate or are “incidental” to the set. 


potheses would necessarily be 
by set. 
ation 


words 


403 


In effect, the perception of these “inci- 
dental” stimuli is made less probable by 
increasing the level of motivation An 
analogous finding in the field of learning 
has been obtained by Bruner, Matter, 
and Papanek (1) who demonstrated that 
by increasing levels of motivation, the 
range of cues to which the organism is 
attentive is reduced, thereby rendering 
less probable any learning incidental to 
the main task at hand. In our study, 
the performance of the high 
anxious Ss relative to the recognition 
of the neutral would seem to 
reflect the operation of a similar mecha 
nism involving impaired breadth of per 
ceptual functioning with increasing levels 
of motivation. 


and low - 


words 


SUMMARY 


An experiment was performed to test the 
hypothesis that perceptual recognition thresh 
olds are a joint function of associative and drive 
Fighty Ss selected on the bas f 
extreme scores on the Taylor A scal constituted 
high- and low-drive groups. Set of the Ss and 
frequency characteristics of the stimulus ma 


strength. 


terials were manipulated as associative variables 
The data were based upon tachistoscopic mean 
recognition thresholds for critical words (those 
and neutral words (those 


congruent with set 


incongruent with set) and support the following 


cone lusions : 


Words, and 


significant in the 


1. The influence of Frequency 
Words X Set highly 
determination of recognition thresholds 

2. Set 
quency of the stimuli and anxiety 

3. The interaction of set and 
that set facilitated the perceptual per 
formance of the Ss while 
performance of the high anxious Ss 

4. The 


between high and low-anxious Ss 


were 
was found to interact with both fre 
of Ss 
anxiety indi 
cated 
low-anxious mpairing 


greatest differences it pe 


imparied recognition of neutral words 


incidental to the set by the h gh-anxiou 


Alternative interpretations of the properties 


of set under the present experimental conditions 
discussed details of the 


sion original 


The results, 


since 
hypothesis were not confirmed 
in general, were seen to be consistent with those 


of studies showing poorer performance for 


anxious Ss in a competitive 
Portions of the 


interpreted from the 


response-type 
situation results were also 


standpoint of impaired 





404 


brea 
with 


JEAN ENGLER AND JAMES T. FREEMAN 


dth of perceptual functioning associated 
high levels of drive. 


REFERENCES 


Baunen, J. S., Marrer, J., & Paranex, M. 
L. Breadth of learning as a function of 
drive level and mechanization. Psychol. 


Reo., 1955, 62, 1-10. 


. Fanses, I. E., & Seence, K. W. Complex 


. Freeman, J. T., & Encues, J. 


learning and conditioning as a function of 
anxiety. J. exp. Psychol., 1953, 45, 120- 
125. 

Perceptual 
recognition thresholds as a function of 
multiple and single set and frequency of 
usage of the stimulus material. Percept. 
Mot. Skills, 1955, 5, 149-154. 


. Houraman, W. H., Carvin, A. D., & Brr- 


. Howes, D. H., & Sotomon, R. L. 


» nao cL 


Terman, M. FEF. New evidence for the 
validity of Taylor’s Manifest Anxiety 
Scale. J. abnorm. soc. Psychol., 1952, 
47, 853-854. 

Visual 
duration thresholds as a function of word 
probability. J. exp. Prychol., 1951, 41, 
401-410. 

Principles of behavior. New 


York: D. Appleton-Century, 1943. 


. McGuyntes, E., Comer, P. B., & Lacey, 


O. L. Visual recognition thresholds as a 
function of word length and word fre- 
quency. J. exp. Psychol., 1952, 44, 61- 
OA. 


. Monracue, E. K. The role of anxiety in 


. Postman, L. 


. Postman, 


serial rote learning. J. 
1953, 45, 91-96. 


exp. Psychol, 


The experimental analysis of 
motivational factors in perception. In 
Current theory and research in motivation: 
a symposium. Lincoln: Univ. of Ne- 
braska Press, 1953. 

L., & Bruner, J. S. Multi- 
plicity of set as a determinant of per- 


. Postman, L., & Crutrcuriecp, R. S. 


. Ramonp, 


. Sotomon, R. L., & Postman, L. 


. Spence, K. W., & Farper, I. E. 


. Spence, K. W., & Tayvor, J. A. 


. Tayror, J. A. 


. Taytor, J. A. 


. Tayior, J. 


. Tuornpixe, E. L., & 


. Wetcn, 


. Wiseé, L. G., & Dramparean, N.C. 


ceptual behavior. 
39, 369-377. 


J. exp. Psychol., 1949, 


The 
stimulus 


Amer. J. 


interaction of need, set and 
structure in a cognitive task. 
Psychol., 1952, 65, 196-217. 

C. K. Anxiety and task as 
determiners of verbal performance. /. 
exp. Psychol., 1953, 46, 120-124. 
Frequency 
of usage as a determinant of recognition 
thresholds for words. J. exp. Psycholl., 
1952, 43, 195-201. 

Condi- 
tioning and extinction as a function of 
anxiety. J. exp. Psychol., 1953, 45, 116- 
119. 

Anxiety 
and strength of the UCS as determiners of 
the amount of eyelid conditioning. /. 
exp. Psychol., 1951, 42, 183-188. 

The relationship of anxiety 
to the conditioned eyelid response. /. 
exp. Psychol., 1951, 41, 81-92. 

A personality scale of 
manifest anxiety. J. abnorm. soc. Psy- 
chol., 1953, 48, 285-290. 

A., & Spence, K. W. The 
relationship of anxiety level to perform- 
ance in serial learning. J. exp. Psychol., 
1952, 44, 61-64. 

Lorce, I. The 
teacher's word book of 30,000 words. New 
York: Columbia Univ. Press, 1944. 

L., & Kuss, J. The effect of 
anxiety on the conditioning rate and 
stability of the PGR. J. Psrychol., 1947, 
23, 83-91. 

Phys- 
iological need, word frequency, and visual 
duration thresholds. J. exp. Psychol., 
1953, 46, 25-31. 


(Received June 17, 1955) 





Journal of Experimental Psychology 
Vol. 51, No. 6, 1956 


RESPONSE GENERALIZATION AS A FUNCTION OF 
INTRATASK RESPONSE SIMILARITY ! 


MERRILL E. NOBLE? 


HARRY P. BAHRICK* 


AND 


The Ohio State University 


It has frequently been pointed out 
(7, p. 309) that a progressive restric- 
tion in the variability of responses 
characterizes certain of the 
Only rarely, how- 
ever, do we find a detailed description 


aspects 
learning process. 


of response distributions to specify 


Investi- 
gators are usually careful in devising 
procedures for the detailed control and 
description of independent variables, 
but they are often content to employ 
relatively gross response indices as 
dependent variables (3). Examples 
of such gross response measures are 
the all-or-none records of key-pressing 
or the right-wrong scoring of verbal 
responses. Perhaps this practice re- 
flects the belief that the most im- 
mediate problems of the behavioral 
sciences lie in the specification of 
important antecedent conditions of 
behavior, and that emphasis upon a 
more analytical examination of re- 
sponse changes must await the refine- 
ment of theory. 

It can be argued, however, that 
even at their present stage of develop- 
ment behavioral theories are not well 
tested because of the relative lack of 
precision used in the measurement of 
responses. This view suggests that 
studies employing a more analytic 
description of responses might permit 


the nature of these changes. 


! This study was conducted in the Laboratory 
of Aviation Psychology of The Ohio State 
University, through support of Contract No. 
AF 33(038)-10528 between the Air Force 
Personnel and Training Research Center and 
The Ohio State University Research Foundation. 

? Now at Kansas State College. 

* The authors are indebted to Drs. Paul M. 
Fitts, David Bakan, and Benton J. Underwood 
for many constructive criticisms. 


Ohio Wesleyan University 


a clarification or resolution of some 
theoretical One area 
in which such an approach might be 
especially fruitful concerns facilitative 
and interference effects 
sponse similarity. Several somewhat 
different viewpoints are held regarding 


controversies. 


due to re- 


the effects of intratask response simi- 
larity (3, 8,9, 13). These viewpoints 
are based for the most part upon the 
results of which 
response used, 


studies in gross 
and in 
which response similarity is varied 
simply along an ordered scale, but 
not specified along a continuous equal- 
interval scale. 


indices are 


The present paper reports an in- 
vestigation in which continuous re- 
sponse distributions 
for two versions of a task. The two 
versions represented different intra- 
task response similarities. Response 
similarity was itself defined by refer- 
ence to an interval distance along a 


were obtained 


continuous scale. This made it pos- 
sible to specify the variance of the 
response distributions to each stimu- 
lus, the and the 
average amount of information trans- 
mitted per response. 
ure 


constant errors, 

This last meas- 
the amount of 
overlap among response distributions. 


also” reflects 


Part | 
Method 


The study was conducted in two parts. In 
Part I a psychophysical scale of force similarity 
was constructed, In Part II response distri 
butions were obtained under two conditions of 
similarity among the required responses 

Apparatus. 
semirigid steel control stick, mounted as shown 
in Fig. 1, and an indicator sensitive to .0O1 in. of 
movement. 


The apparatus consisted of a 


The indicator was placed in such a 





MERRILL E, NOBLE AND HARRY P. BAHRICK 


Pressure Gouge 


jC 
\%) 

Centrol { 
Hondig - / 


al 
\ 


4 Ld 











Schematic drawing of the pressure 
control. 


position that each mark on the scale represented 
}-lb. force exerted on a handle by S. This 
system was approximately isometric, with .1 in. 
of movement resulting when a force of 50 |b. was 
exerted on the stick handle 

Subjects.—Ten right-handed male students at 
Ohio Wesleyan University were employed on an 
hourly basis. Each S served under each of the 
10 conditions of force. 

Forces.—Ten forces, 1, 2 
35, and 40 lb., were used. 

Procedure.—Fach S made a total of 80 re- 
sponses to a single one of the 10 forces during 
each of ten 4-hr. sessions. The 10 sessions were 
separated from one another by at least 24 hr. 
The sequence of forces required of each S in 
successive 


3, 5, 10, 15, 20, 30, 


sessions was counterbalanced by 


means of a random latin square. The S was 


blindfolded and instructed to push a stick handle 
slowly when E said “now” until he thought the 
desired force was attained, and to then release 
the handle and relax until the next response. 
The S was told he would have to learn to exert 
the desired amount of force by trial and error. 
After each response FE recorded the amount of 
force to the nearest } |b. and told S his error in 
i-lb. units., e.g., if the desired force was 10 lb. 
and S exerted 11-lb. force he was told “four 
units over.” There was a 15-sec. rest between 
responses. 


Results 


Learning curves plotted for suc- 
cessive blocks of 10 responses indi- 
cated that there was no appreciable 
learning beyond 10 trials on any 
stimulus. Accordingly, means and 
SD’s of the distribution of responses 
were determined for each S for each 
force for Trials 11-80. Inspection of 
these data indicated that constant 
errors were small and were normally 
distributed among Ss for each force. 
Figure 2 shows an empirical gradient 
of response generalization for a stimu- 
lus of 1Olb. The responses are pooled 
for all Ss. Equal discriminability 


units on the abscissa are obtained 





20 


-2 “1 
‘ ‘ 








Equo!l Discriminability Units 


° 
‘ 





-300 -150 


-150 -@00 -450 
Similarity of Obtained to Correct Response 


(Correct ~ Obtomed 


Fic. 2. 


in Pounds) 


An empirical gradient of response generalization for a response of 10 lb. 





RESPONSE GENERALIZATION 


from Equation 1 by 
described below. 

The average SD for each distri- 
bution of responses was computed as 
the square root of the within-groups 
mean square for each of the 10 forces. 
The 70 responses of each S to each 
stimulus were treated as a group, so 
that the SD’s include only variations 
within Ss, but not variation between 
Ss. Figure 3 the relation 
between the required force and the 
ratio of the average SD to the required 
force. Also shown in Fig. 3 are 
similar data from a study by Jenkins 
(5) in which Ss attempted to repro- 
duce a given experienced force by 
pushing against a semirigid control 
stick. Examination of these data 
indicates that the Jenkins study and 
the present one yielded functions of a 
similar type although Jenkins’ Ss had 
less variance than Ss of the present 
study. The relation between the 
average SD and the amount of re- 
quired force is shown in Fig. 4. 
Inspection indicated that this relation 
approximates a straight-line function 
expressed by the equation 


SD = 1, + .54 (1) 


a procedure 


shows 


where SD is the square root of the 
within-group mean square in pounds 
and p is the force in pounds. 

A scale of force similarity was then 
determined. Forces were selected in 
that successive 
were separated by the square root of 


such a way} forces 


the sum of squares of their respective 


average SD’s. The first force was 


4 . ha hh Ce 

STANDARD PRESSURE IN POUNDS 

.4. Average SD as a function 
of pressure 





Yordord Presvese 9 Pouch 


Fic. 3. 


Average SD per required pressure 
as a function of pressure. 


arbitrarily chosen to be 1 Ib. The 
remaining forces were determined by 
solving appropriate quadratic equa- 
tions, using as constants the values 
obtained from Equation 1. The re- 
sulting scale approximates an equal- 
interval scale, since there should be 
approximately an equal percentage of 
overlap of response distributions for 
The 


8¢ ale are 


all adjacent forces. values ob- 


tained for the 


Table 1. 


shown in 


TABLE 1 


Tur 


Seaie Values 
from Part I 


1.98 | 3.10 | 4.40 | 5.89 


Condition N 
Condition W 


4.50 | 6.00 


3,00 | 6,00 | 


763 | 9.63 11.80 | 14.53 17.63 


7.75 


Seven Requirnep Forces 


Force in Pounds 


21.13 


9.75 | 12.00 | 14.50) 17.75 


| 9.75 14.50 | 121.25 








408 


Part Il 
Method 


Apparatus.—The apparatus was the same as 
that used in Part I. 

Subjects.—Fifty right-handed male students 
at The Ohio State University were divided by 
means of a table of random numbers into two 
groups of 25 Ss each. 

Forces.—Two sets of seven forces each were 
selected from the scale values determined in 
Part I. For Cond. N (narrow spacing) suc- 
cessive forces were separated from one another 
by one scale unit; for Cond. W (wide spacing) 
successive forces were separated from one 
another by two scale units. Three forces, in- 
cluding the median force for each set, were 
common to both conditions. ‘The seven forces, 
rounded to the nearest } lb., for each condition 
are given in Table 1. 

Procedure.—Preliminary practice was given 
on the central force (9.75 lb.). The S was 
blindfolded and told to exert force slowly 
against the handle of the semirigid stick when 
E called out the stimulus number “39.” The S 
was instructed that the stimulus represented an 
arbitrary unit, and that he would have to learn 
by trial and error the amount of force required. 
Knowledge of results was given after each 
response asin Part I, ‘The preliminary practice 
continued until S responded with a force between 


8-11 Ib. 


Following this, each S made a total 
of 280 responses in a single session, 40 responses 
with each of the seven stimuli of the condition 
to which he had been assigned. Within this 
restriction the 280 responses were randomized 
by using three different random orders of stimuli 


for each group of Ss. 
between responses. 

For both conditions the stimulus for each 
response was the amount of force to be exerted, 
called out by EF in }-b. units. Thus, if the 
required force was 6 |b., E called out “24”; for a 
force of 9.75 lb., E said “39”; ete. The S was 
blindfolded and, as before, told only that the 
stimuli represented arbitrary units. Knowledge 
of results was given after each response as in 
Part I. 

It should be noted that the stimuli for Cond. 
N and W were different, i.e., for each condition 
the stimulus was a number representing the 
particular amount of force to be exerted in 
}-lb. units. It is possible that this difference in 
the stimuli of the two conditions could affect 
the results. That is to say, even though for 
both conditions the stimuli were highly dis- 
criminable from one another, it is possible that 
S adjusted the error he would tolerate to the 
amount of separation between adjacent stimuli 
instead of having a constant set to respond as 


There was an 8-sec. rest 


MERRILL E. NOBLE AND HARRY P. BAHRICK 


closely as possible to the required force. Since 
for Cond. W the required forces were further 
apart than for Cond. N, a flexible error tolerance 
would tend to result in larger variances for 
Cond. W than for Cond. N. Consequently, 2 
group of 15 right-handed Ss was added at a 
later date in order to test this effect. The 
stimuli for this control condition (Cond. C) were 
identical to those of Cond. W; the required 
responses were those of Cond. N. Knowledge 
of results were given as in Cond. N, i.e., the 
number of }-lb. units over or under the forces 
desired in Cond. N. Otherwise, the procedure 
was the same as before. 


Results 


For both conditions the mean 
constant error (CE) and the square 
root of the within-group mean square 
were computed for successive 5-trial 
blocks. The SD’s again include only 
variance within Ss, not variance 
between Ss. These data are pre- 
sented in Table 2. Inspection of the 
data shows that learning resulted 
primarily in a reduction in the magni- 
tude of the CE’s, and to a much lesser 
extent a reduction in the variability 
around these CE’s. Early in training 
there is a pronounced range effect for 
both groups, i.e., a tendency to 
respond with too much force to the 
lowest desired responses, and with too 
little force to the highest desired 
responses. This tendency has been 
observed by other investigators (4), 
but it is noteworthy here that this 
effect is transitory, and with con- 
tinued practice the direction of the 
CE tends to become positive even for 
the higher forces, i.e., Ss tend to exert 
more than the required amount of 
force. Of greatest interest are the 
data for the three stimuli common to 
both groups: 24,” “39,” and ’’58” 
quarter-pounds. Inspection of Table 
2 indicates that little or no learning 
took place after the initial 20 re- 
sponses to these stimuli for both 
groups, and during the final 20 
responses to these stimuli the CE’s 





RESPONSE GENERALIZATION 


TABLE 2 


Mean Response ano SD ror Successive Biocxs or Five 


ondition N 


$1 


f 
~ 
~ 


Mean | SD 


36.00 | 7.00 


AAAAMAMIU 


ondition W 


24 


n 


10.10 
5.61 
6.87 
5.04 
6.10 


33.73 
30.44 8.60 
31.46 8.38 
29.76 8.13 
5.98 29.57 | 6.83 
8.46| 27.89 | 7.54 
5.99| 29.77 | 8.07 


4 

2 
2. 
3 
2.6 
3 
2. 


and within-group variances are rela- 
tively stable. The CE’s mean 
within-group variance for these last 
20 responses to the three common 
stimuli were computed for both groups 
and are shown in Table 3. T tests 
show that the mean within-group 
variance is significantly (p < .O1) 
larger for Cond. W than for Cond. N 
for all three forces. Variability of 
variance the 25 Ss of each 
group was used as a basis for the error 
terms. Condition W 
significantly larger constant 
(p < .O1) than Cond. N. 


The CE’s and mean within-group 


and 


among 


also results in 
errors 


variances for the last 20 responses to 
each stimulus were also computed for 
the control group. None of the CE’s 
or variances differs significantly (p 
> .05) from those obtained for Cond. 


N. 


Desired Force 


Mean 


38.66 
40.30 
39.57 
| 40.71 
40.35 
40.83 
41.72 
| 42.58 i § 41 
| 


Desired Force 


33.54 | 9.87 | 39.97 
9.61 | 46.06 
44.40 
46.86 
42.42 
42.27 
44.19 
46.40 | 


Re SPONSES 


In Quarter-Pounds 


SoMa 
—- 


] 
I we de te RS Ge 


~~ « 

xo 

Sn MmaIaAN 
mwN he 


In Quarter. Pounc 


10.62 
10.92 
11.29 
13.76 

9.64 
10.80 
11.82 

8.74 


—— wwe 


The 
information (/,), response equivoca- 
tion (£,), and information transmitted 
(/,) was computed for Cond. N and 


average amount of response 


TABLE 3 


Comparison or THE Tourer Resronst 
BuUTIONS Common TO Conpitions N AND 
W, Basep on tue Last 20 Responses 

To EFacn Force 


Distei 


Force (ln Quarter. Pounds) 


Cond. N 
Mean 
SD 
Variance 


SD 


Cond. W 
Mean 
SD 
Variance 


SD 





410 


TABLE 4 
INFORMATIONAL ANALYSIS OF THE Data 


r = 


Cond 


I, (Bits) BE, (Bits) | Ii (Bits) 


4.9200 | 
5.188 


N 6.010 
W | 6.775 


1.090 
1,587 





W. For these computations a 7- 
column (representing the seven re- 
quired forces) by 165-row (repre- 
senting forces actually exerted, in 
1-lb. units) matrix was used. The 
data for all Ss in a given condition 
were entered in the matrix, i.e., the 
distributions of the last 20 responses 
to each of the seven required forces 
were pooled for all Ss. This pro- 
cedure, which results in a slightly 
inflated estimate of E, because of 
pooling of separate S’s response dis- 


tributions, was followed because of ’ 


the limited number of responses per 
S and because it permitted a different 
comparison of the two conditions 
based upon group performance and 
ignoring individuals. The results of 
this analysis are shown in Table 4. 
Reference to Table 4 shows that the 
greatest amount of information was 
transmitted under Cond. W. Thus, 
even though Cond. W resulted in the 
larger CE’s and the larger within- 
groups variance per individual re- 
sponse distribution, this result indi- 
cates that there was less overlap 
between adjacent response distri- 
butions for this condition than for 


Cond. N. 


Discussion 


The response distributions obtained 
in Parts I and II are interpreted by the 
authors as empirical gradients of response 
generalization. However, since our ex- 
perimental situation is not the type 
traditionally used for the observation of 
response generalization, it is necessary 
to indicate the ways in which the pro- 
cedures followed here conform to the 


MERRILL E. NOBLE AND HARRY P. BAHRICK 


essential operations that define response 
generalization. 

Response generalization is said to exist 
when “A given stimulus condition, once 
connected with a particular response will 
also elicit other responses which are 
related in some way to the first or trained 
response” (6, p. 326). In the traditional 
method used for the study of response 
generalization, a particular stimulus is 
presented a number of times during a 
training series and one specific response 
to this stimulus is reinforced (the rein- 
forcement may involve merely the pres- 
entation of the correct response on a 
memory drum). Other (“incorrect’’) 
responses to the stimulus usually occur 
during this training period, but these 
responses supposedly are not reinforced. 
At some arbitrary point in time the 
training is terminated and measurements 
are made of the frequency of responses 
similar to the one reinforced during 
training (13, 14). 

In comparing this traditional pro- 
cedure with the one used in the present 
study, it is seen that the present study 
lacked the initial controlled training 
period during which positive reinforce- 
ment is given to a specific stimulus- 
response combination. Instead, in the 
present study a large number of similar 
responses occurred to a given stimulus on 
different trials. All of these responses 
may seem to have been equally rein- 
forced! It should be noted with respect 
to the typical procedure, however, that a 
variety of slightly different responses 
will occur during all training situations, 
though the response deviations that are 
observed may be only relatively large 
ones. That is to say, the response units 
usually employed in discrete S-R tasks 
are relatively gross, and indeed are often 
of an all-or-none character. Thus, a 
specific key is, or is not, pressed; a 
specific word is, or is not, spoken. 
Minor variability, such as in the force 
applied to depress a key, is usually not 
observed, and therefore a variety of 
responses will be reinforced. Only gross 
errors are recorded in such situations and 
these are not reinforced. Such errors 
would occupy the tails of the response 





RESPONSE GENERALIZATION 


distributions in the present study, or 
they would occur through secondary or 
mediated generalization. 

Examination of the data of the present 
study provides some evidence also that 
not all responses are equally reinforced, 
since there was a progressive change 
toward the response designated as cor- 
rect, at least during the early part of 
training. When an incorrect response 
occurred (e.g., a response of “55” to a 
stimulus of 39’) E’s statement of ‘16 
over” did not directly strengthen this 
response; instead, § would tend to use 
less force on the next such trial, thus 
bringing his response closer to the desired 
Nevertheless, the most im- 
portant feature of these data is that a 
variety of responses were produced to 
each stimulus throughout training, de- 
spite differential reinforcements. The 
resulting gradient of responses can be 
attributed to a partial failure of dis- 
crimination between the specific correct 
response and the other similar responses. 


value. 


If generalization and discrimination are 
considered to be at opposite poles of a 
continuum, this is tantamount to stating 
that generalization occurred, and in an 
amount inversely related to the similarity 
between the correct and the generalized 
response. With this view, response gen- 
eralization can be observed during the 
acquisition training of any response by 
noting the occurrence of existing response 
variations. This has already been sug- 
gested by Brogden, (2, p. 590) and has 
formed the basis of an earlier study by 
Arnold (1). 

Reference to the data of discrete S-R 
studies suggests much greater differences 
between the correct and generalized 
responses than were observed in the 
present study. A precise comparison is 
not possible, of course, without com- 
parable continuous scales of response 
similarity. It seems likely, however, 
that the generalized responses of discrete 
S-R tasks often reflect instances of 
secondary or mediated generalization, 
while the present data and those obtained 
by Arnold exemplify: primary response 
generalization. 

In this connection, it is of interest to 
compare the other’ results of this study 


411 


with those of studies using discrete S-R 
tasks. The results here show that 
practice in making the more similar 
responses (Cond. N) resulted in smaller 
variances and CE’s for the response 
distributions than was the case when the 
less similar responses (Cond. W) were 


made. Furthermore, a comparison of 


the §D’s obtained in Part II with those 
predicted from Equation 1 from the data 
of Part I, shows that the predicted values 
for Cond. N closely approach the ob- 


tained values for five of the seven re- 
sponse distributions (for the two lowest 
forces the obtained § D's were larger than 
predicted). This was not the case for 
Cond. W, however. There, the ob- 
tained §D’s were all substantially larger 
than the predicted values. The failure 
to find significant differences between the 
variances or the CE’s for Cond. N and C 
suggests that these results cannot be 
attributed to an increase in amount of 
separation among the already highly 
discriminable stimuli of the tasks. The 
informational analyses showed that the 
amount of overlap among response 
distributions was greatest for the con- 
dition having the greater response simi- 
larity, despite the finding that an increase 
in intratask response similarity led to 
decreased response variability. 

As noted earlier, “minor” variability 
usually is ignored in discrete S—R tasks, 
and only such gross errors as calling out 
an entirely different word or pressing 
another button recorded. Such 
studies have shown that when stimuli 
are held constant an increase in intratask 
response similarity results in an increased 
number of intrusion errors (10, 11, 12). 
These findings are often accounted for 
by means of the concept of response 
generalization in one form or another 
(3). The generalized response strength 
is said to be inversely related to the 
degree of similarity between the gen- 
eralized and the reinforced response. If 
this assumption is applied to the data 
here it would suggest that the overlap 
of response distributions should be a 
simple direct function of the similarity 
among the required responses of the task. 
The present results, however, show that 
the distribution of responses around 


are 





412 


designated response points becomes less 
variable as response similarity is in- 
creased. If this result is a general one, 
it suggests that intrusion errors due to 
primary response generalization are not 
a simple function of response similarity, 
since the shape of the generalization 
gradients themselves is also affected by 
the degree of response similarity. In 
order to determine whether the present 
results have generality, it will, of course, 
be necessary to develop other tasks in 
which response similarity can be specified 
along a continuous scale. 

The results obtained from the in- 
formation analysis have a bearing upon 
the problem of coding of responses. 
They suggest that for optimum coding 
the range of responses should be extended 
so that the scale interval between re- 
sponse categories is maximized. The 


informational analyses suggest that even 
with optimum spacing of pressure re- 
sponses between 1 and 30 |b., only about 
three absolutely discriminable categories 
of responses are possible if the responses 
of several Ss are placed in a single dis- 


tribution. Information transmission for 
the responses of single individuals would 
doubtless be greater. 


SUMMARY 


This study points out the advantages of 
obtaining distributions of responses permitted 
to vary along a scaled continuum in order to 
determine the effects of response similarity upon 
response generalization, 

The Ss were trained to exert different amounts 
of force on a semirigid control in response to 
number stimuli. Intratask similarity was varied 
by employing two versions of the task, differing 
in the degree of separation between required 
adjacent force responses. Responses were 
measured on a continuous scale and response 
distributions were obtained for several response 
points under both conditions of response simi- 
larity. ‘These distributions are interpreted as 
empirical gradients of response generalization. 

It is shown that generalization of individual 
responses is significantly smaller for the condition 
of greater intratask response similarity. This 
condition results in steeper generalization 
gradients. Despite this effect adjacent response 
distributions show more overlap when the re- 
quired responses are spaced more closely. The 
procedure and results of this study are compared 


MERRILL E, NOBLE AND HARRY P. BAHRICK 


with those of discrete S-R tasks investigating 
the effects of response similarity. 


REFERENCES 

. Annwoio, W. J. 
gation of primary response generalization. 
J. comp. Psychol., 1945, 38, 87-102 

. Brocpen,W.J. Animal studies of learning. 
In S. S. Stevens (Ed.), Handbook of 
experimental psychology. New York: 
Wiley, 1951. Ch. 10, 

. Duncan, C. P. Development of response 
generalization gradients. J. exp. Psy- 
chol., 1955, 50, 26-30. 

. Evtson, D. G., & Wueerer, L., Jr. The 
range effect. Dayton, O.: USAF Air 
Materiel Command, Wright-Patterson 
Air Force Base, 1949. (AF Tech. Rep. 
No. 5813). 

. Jenxins, W. O. The discrimination and 
reproduction of motor adjustment with 
various types of aircraft controls. 
J. Psychol., 1947, 60, 397-406. 

. McGeocnu, J. A., & Inton, A. L. The 
psychology of human learning. New 
York: Longmans, Green, 1952. 

. Skinner, B. F. The behavior of organisms: 
an experimental analysis. New York: 
Appleton-Century, 1938. 

. Skinner, B. F. Science and human be- 
havior. New York: Macmillan, 1953. 

. Unperwoop, B. J. Associative transfer in 
verbal learning as a function of response 
similarity and degree of first-list learning. 
J. exp. Psychol., 1951, 42, 44-53. 

. Unperwoop, B. J. Studies of distributed 
practice: II. Learning and retention of 
paired adjective lists with two levels of 
intralist similarity. J. exp. Psychol., 
1951, 42, 153-161. 

. Unperwoop, B. J. Studies of distributed 
practice: VIII. Learning and retention 
of paired nonsense syllables as a function 
of intralist similarity. J. exp. Psychol., 
1953, 45, 133-142. 

. Unperwoop, B. J. Studies of distributed 
practice: IX. Learning and retention 
of paired adjectives as a function of intra- 
list similarity. J. exp. Psychol., 1953, 45, 
143-149. 

. Unperwoop, B. J., & Hughes, R. H. 
Gradients of generalized verbal responses. 
Amer. J. Psychol., 1950, 68, 422-430. 

. Wickens, D. D. The transference of con- 
ditioned excitation and conditioned in- 
hibition from one muscle group to the 
antagonistic muscle group. J. exp. 
Psychol., 1938, 22, 101-123. 


(Received June 9, 1955) 


An exploratory investi- 


Amer. 





Journal of Experimental Psychology 
Vol. 51, No. 6, 1956 


A FAILURE TO FIND SECOND-ORDER 


SEMANTIC 


GENERALIZATION 


IRVING 


MALTZMAN AND 


LLOYD O. BROOKS 


University of California, Los Angele 


Probably the most extensive series 
of studies on semantic generalization 
have been conducted by Cofer, Foley, 
and their associates. Two of their 
studies provide what appears to be 
the most unequivocal evidence for the 


hypothesis. One study reported the 


occurrence of semantic generalization 
between synonyms, synonyms of syn- 


onyms (second-order semantic gen- 
eralization) and between two different 
lists of homophones (2). Another 
study purported to show semantic 
generalization between antonyms (1). 
A verbal transfer method was utilized 


in both experiments. 


The Ss were first given a butler or practice 
list followed by a 
equating list and its recall test 


distributed 


recall and then an 
They 
different 
receiving four presentations of a reinforcement 


written 
were then 
among five groups, each 
list. Reinforcement lists practiced by the ex 
perimental groups were related semantically to 
a subseque nt test list The latter was prese nted 
written recall 
lists of homo 
experimental 


followed by a 
different 


only once and 
received 


other 


Two groups 
The 


synonyms 


phones conditions 


were and second-order synonyms 
\ control grouj received a re inforcement list of 
words unrelated to the test list The Ss were 
told that they would not be asked to recall the 
Since Ss tested for 


recall of the first two l sts, howeve r, the effective- 


reinforcement list were 


ness of these instructions for the reinforcement 
list is qu stionable 

In terms of mean test words recalled, only 
one of the homophone groups did significantly 
better than the control group. However, in 
terms of medians all four experimental groups 
differed from the group 
But our application of Shen's test (7) to the 


data indicated that for the control group the 


significantly control 


mean would be a more reliable measure than the 
and the lack of 


rm edians, the 


median. In view of this finding, 


agreement between means and 


reliability of the results of this experiment is 
highly questionable 


The experiment on semantic general 
followed essentially the 
study All Ss 
equating list An 


received a 


ization 
between antonyms (1 


same procedure as the first 


received a buffer and an 


experimental group reinforcement 


list of antonyms of the test words while a control 


group received words unrelated to the test list 


An important variation from the first experi 


ment was the addition of a second control grouy 
which did not receive a reinforcement list, but 


was given the test list immediately after the 


written recall of the equating list 


Semantic generalization between 


antonyms 


was reported to have occurred ever rh the 


mean difference between the 


and the first control gro if as sien 


at only the 10% level (two-tailed test 


grout hicant 
Mean 
recall of the second control grouy was compared 
only with the first control group from which it 
did not differ significant], 


that the 


Insy ection indicates 


second control group likewise did not 
differ significantly from the experimental grouy 
Differences 


among the three 


liffer 


medians of the 


groups were even smaller than the meat 


ences, but their reliabilities were not reported 

The abse nee of a difference be 
tween the experimental grout and the 
control 


significant 
second 
group contradicts the hypothesis that 
generalization occurred in the 


If repetition of the antonyn 


semantic former 


group reinlorce 
ment list produced increments in habit strength 
which ge neralized to the test li t. the test words 
should have been recalled more readily by the 
than by the 


receive any 


experimental 
did not 


that 
The 
that ila 


1 
control 


control grouy 
reinforcement t 
absence of any such difference implies 
condition comparable to the 
had 


evidence clearly contrary to the h 


second 


grout been run in the first experiment 


ypothesis of 


semantic generalization would have been ob 


tained here as well 
Despite the que 


stionable differ 


experime ntal and 


nature of the 
between the 
these 


ences obtained 
control groups in 
(3, 4 has 


propose | as an 


experiments 
offerec an hy 


Oegood 
pothesis which he 
alternatiy to the semantic 


data He 
if experimental Ss recognize the 


gene ralization 
that 


interpretation of the 
argucs 
presence of a semant 


relationship between the 


reinforcement and test lists } ould be able 


to reconstruct the test list n part, from 





414 


their recall of the words of the reinforcement list. 
A control group that receives unrelated rejn- 
forcement words could not reconstruct the test 
list, and therefore would have a poorer recall 
score. It would be easier to reconstruct the 
test-list words from recall of their homophones 
than from synonyms, because of the greater 
number of alternative responses in the latter 
case. This would account for the greater 
apparent generalization between homophones 
than between synonyms found with the verbal 
transfer method. In contrast, greater gener- 
alization between synonyms has been found by 
the conditioning procedure (5, 6). 

Although Osgood’s hypothesis gives a satis- 
factory alternative explanation of the superior 
recall of the homophone and synonym conditions 
as compared with the control condition, provided 
that we assume that these differences are reliable, 
it cannot account for the superior recall of the 
second-order synonym condition. Reconstruc- 
tion of the test list from synonyms of their 
synonyms seems unlikely, because there is no 
apparent semantic relationship between the 
reinforcement and test words, for example, 
killed and sent. Even if this relationship were 
known, there are such a large number of alter- 
native responses possible, that it seems im- 
probable that reconstruction of the test list 
could occur. If reconstruction is not possible, 
then increased recall of the test list as a con- 
sequence of practice on synonyms of their 
synonyms would have to be attributed to the 
effects of semantic generalization. 


Second-order synonyms were there- 
fore chosen for study in the present 
experiment because positive results 
could be interpreted more readily than 


in the other semantic conditions. ‘To 
further ensure that reconstruction of 
the test list could not occur, the 
present experiments were designed 
to reduce the possibility of Ss’ dis- 
covering the relationship between 
reinforcement and test words. The 
first experiment to be reported here 
was also designed to determine 
whether second-order semantic gen- 
eralization, if demonstrable, varies as 
a function of the number of rein- 
forcements. 


Experiment I 
Method 


Subjects. —The Ss were 120 students obtained 
from introductory psychology classes. There 


IRVING MALTZMAN AND LLOYD O. BROOKS 


were 20 Ss randomly assigned in pairs to each 
of the six different conditions. 

Materials.—The reinforcement and test words 
employed were obtained from the lists given by 
Foley and Cofer (2). Each word was printed 
on a 2 X 2-in. slide and projected on a beaded 
screen by means of a Spindler and Sauppe 
Selectroslide projector. 

Procedure.—All Ss received a reinforcement 
series of 10 different slides followed after a 1- 
min, rest period by a test series of 20 slides. 
Each reinforcement series presented to the 
experimental groups contained a word which 
was a synonym of a synonym of a critical word 
in the test list. Three experimental groups 
differing only in the number of repetitions of the 
reinforcement series were employed. These 
groups received 4, 8, and 20, repetitions, re- 
spectively. Foley and Cofer’s (2) groups 
received 4 repetitions of the reinforcement list. 
Three control groups receiving 4, 8, and 20 
repetitions of the reinforcement series were also 
employed. The reinforcement words for these 
groups were not related in any obvious manner 
to the critical or the noncritical test words. 
Duplicate slides were made of each word in the 
experimental and control practice lists. It was 
therefore possible to lay out a prearranged 
random order of slides for each repetition of the 
practice lists. This procedure eliminated the 
need for reshuffling the slides between repetitions 
in order to randomize their serial positions. 

The test list given to the experimental and 
control groups contained the 10 critical words 
employed by Foley and Cofer (2). Each of 
these words had a synonym of its synonym in 
the reinforcement list presented to the experi- 
mental groups. In addition, 10 noncritical 
words were randomly interspersed among the 
critical test words. ‘The noncritical words were 
added to the test list to reduce the possibility 
of experimental Ss recognizing a semantic 
relationship between the reinforcement and test 
lists. The noncritical words were Christian 
names selected from Foley and Cofer’s (2) 
equating list. 

The Ss were run in pairs in all six conditions. 
Prior to the presentation of the reinforcement 
series they were told that the experiment was an 
investigation of the speed of perceptual recogni- 
tion. They were asked to call out the word as 
soon as it flashed on the screen, and that their 
speed of response would be recorded. Prior to 
the presentation of the test list Ss were in- 
structed not to call out, but to think of the 
meaning of each word. They were also told 
that they would be tested on their retention of 
this series. 

Following one presentation of the test series, 
Ss were given a sheet of paper and were asked 
to record as many of the test words as possible, 





SEMANTIC GENERALIZATION 


TABLE 1 


Mean Recatt or Rernwrorcement Worps 


Experimental 
Groups 


Mean 
SD 


There was no time limit. 
for a 


They were then asked 
recall of the from the 
reinforcement serics. The Ss were questioned 


written words 
as to their interpretation of the purpose of the 
experiment and as to the 
relatic nshig 


presence ot any 


or similarity between the practice 


' 


and test lists. 


Results 


number 


Table 
of reinforcement words re- 
called by the experimental and control 
groups. Hartley’s test (8) indicated 
that the variances were homogeneous. 
An analysis of variance resulted in a 
significant F for practice effects, 
F = 12.99, for 2 and 114 df, P < OL. 


The differences between recall of the 


1 shows the mean 


experimental and control reinforce- 
ment lists and the interaction between 
amount of practice and the kind of 
list were not significant. 

Table 2 shows the mean recall of 
critical words 
the experimental 
groups. Hartley's 
that the variances for the 

noncritical the 
different groups were homogeneous. 
None of the F ratios for the main 
effects or their interaction reached an 
acceptable level of confidence. 

Harley’s indicated that the 
variances for recall of the critical test 
The ana- 
lysis of variance for the critical test 
words likewise failed to indicate any 
significant main effects or interaction. 
The F for treatments was 1.05, 
df = | and 114. The F for practice 
level was .74, df = 2 and 114. These 


and noncritical test 


obtained by and 
control test in- 
dicated 


recall of words by 


test 


words were homogeneous. 


415 


results indicate the absence of second- 
order semantic generalization effects. 

None of the experimental Ss had 
any knowledge of the semantic rela- 
tionship existing between the rein- 
forcement words and the critical test 
words. However, a 
number of experimental and control 
Ss assumed that there 
form of association or relationship 
between words of the reinforcement 
list and the noncritical words of the 
test list, the Christian names. The 
failure to find a significant treatment 


considerable 


was some 


or practice effect in the recall of the 


test list may, therefore, have been 
due to competing responses aroused 
by the set to look for associations 
between reinforcement words and the 
noncritical test words. Another pos- 
sible source of interfering effects may 
have been the competitive nature of 


the practice session. 


Experiment II 


The purpose of the second experi- 


whether 
obtained 


ment was to determine 
different results would be 
when the test list contained only 
critical words, as was the case in the 


Foley and Cofer study (2), and when 


TABLE 2 


Mean Recaut or Critical ann 
NoncriticaL Test Worps 


Critical Test Word 
| 

+45 430'4.0 

4.00 | 4.58 | 4.75 | 4.61 


Noner tical lest Words 





IRVING 


TABLE 3 


Mean Recatt or Test Series 


Es eae 
6.00 
| 6.31 


} 


5.00 
6.02 


Ss were run individually. A second 
control group was added in this 
experiment in order to determine 
whether the reinforcement lists were 
producing proactive inhibition for 
recall of the test list. 


Method 


Subjects.—Forty-five students from intro- 
ductory psychology classes were employed in 
this experiment, 15 in each of the two control 
groups and the one experimental group. 

Procedure.—The first control group (C4) and 
the experimental group (X,4) were given four 
presentations of their respective reinforcement 
liste before the presentation of the 10 critical 
test words. The second control group (Co) 
was given the test list without prior exposure 
to a reinforcement list. The Ss were run 
individually. In all other details the procedure 
was the same as in the previous experiment. 


Results.—-Table 3 shows the mean 
recall scores for the test list obtained 
by the three groups. The variances 
were homogeneous and the F ratio 
obtained from the simple analysis of 
variance was less than unity. 

These results indicate that the 
failure to find a difference in Exp. I 
between the experimental and control 
groups in their recall of the test list 
was not due to the presence of non- 
critical words in the test list, or the 
competitive social nature of the 
experimental situation. 


Discussion 


It may be argued, perhaps, that our 


failure to find second-order semantic 
generalization was due to the low level of 
motivation for learning the reinforce- 
ment list. The training conditions of 


MALTZMAN AND LLOYD O. 


BROOKS 


these experiments approximate those of 
incidental learning. The Ss in Cofer and 
Foley’s experiments, perhaps inadver- 
tently, were more highly motivated to 
learn the reinforcement list due to their 
experience of recall tests on the two prior 
lists. Recall of the reinforcement list in 
our first experiment, however, showed a 
significant increase in retention with 
increased training despite the relative 
lack of motivation to learn. Since the 
words of the reinforcement list acquired ’ 
increments in habit strength, as indicated 
by the superior recall of the test list of 
the experimental groups receiving a 
large number of repetitions of the 
reinforcement list, generalization to syn- 
onyms of their synonyms should have 
occurred. 

The previous analysis of Cofer and 
Foley’s experiments on semantic gen- 
eralization and the results of the present 
experiments indicate that there is no 
sound basis for assuming that semantic 
generalization as manifested by verbal 
transfer occurs between homophones, 
synonyms, antonyms, or second-order 
synonyms. However, the possibility 
still remains that further increases in 
motivational level and number of rein- 
forcements might produce second-order 
semantic generalization, and other meth- 
ods may very well demonstrate these 
different kinds of semantic generalization 


(5, 6). 
SUMMARY 


Analysis of previous experiments on semantic 
generalization by Cofer and Foley using the 
verbal transfer method indicated that the 
evidence for semantic generalization between 
homophones, antonyms, synonyms, and second- 
order synonyms lacks an acceptable degree of 
reliability. Apparent generalization among 
semantic relationships other than second-order 
synonyms may be interpreted in an alternative 
manner by Osgood’s reconstruction hypothesis. 
As a consequence, two experiments employing 
the method of verbal transfer were conducted in 
an attempt to obtain second-order semantic 
generalization. In the first experiment, experi- 
mental groups received varying amounts of 
practice on words which were synonyms of 
synonyms of test words, Control groups 
received varying amounts of practice on words 
unrelated to the test words. Superior recall of 





SEMANTIC GENERALIZATION 417 


the critical test words by the experimental 
groups would indicate the occurrence of gener- 
alization. No lifference between experimental 
and control groups was obtained in their recall 
of the test words \ second experiment designed 
to eliminate possible uncontrolled variables 
interfering with verbal transfer likewise failed 
to prov ide any ev idence of second orde r semantic 
generalization 


REFERENCES 


Corer, C. N., Janis, M. G., & Rowett, 
M. M. Mediated generalization and the 
interpretation of verbal behavior: III 
Experimental study of antonym gradients 
J. exp. Psychol., 1943, 32, 266-269. 

Fo.ey, J. P., & Corer, C. N. Mediated 


generalization and the interpretation of 
verbal behavior: Il. Experimental study 


of certain homophone and synonym gra 


dients J exp Psychol., 1943, 32, 169 
175 
Oscoopn, C. F 
of meaning 
197-237 
Oscoop, C. E 
mental psychology 
Univ. Pre $s, 1953 
Razean, G. H. §S 


The nature and measurement 


Psychol. Bull., 1952, 49, 


Method and theory in experi 
New York: Oxford 


\ quantitative study of 

meaning by a conditioned salivary 

(semantic 
Science, 1939, 90, 89-90 

Riess, B. F 
volving the galvanic skin reflex. J. exp 
Psychol » 1940, 26, 238-240 

SHEN, E. Note on the sampling error of the 
median. J. educ. Psychol., 1935, 26, 
154-156 

Waker, H. M., & Lev, J. Statistical in 
ference. New York: Holt, 1953 


technique conditioning 


Semantic conditioning in 


(Received May 27, 1955 








Journal {Spots Psychology 
Vol. 51, No. 6, 1956 


TASK INSTRUCTIONS FOR ANAGRAMS FOLLOWING 
DIFFERENT TASK INSTRUCTIONS 
AND TRAINING! 


IRVING MALTZMAN, EUGENE EISMAN,? LLOYD O. BROOKS, 
AND WILLIAM M. SMITH 


University of California at Los Angeles 


Previous studies by Rees and Israel 
(4) and by Maltzman and Morrisett 
(3) have demonstrated that task 
instructions may increase the dis- 
position or set for a given class of 
solutions to anagrams possessing mul- 
tiple solutions. The purpose of the 
present experiment was to determine 
whether task instructions would still 
have the effect of inducing a given 
set when prior instructions and train- 
ing have induced a different set. 


Metuop 


Subjects.—Ninety-six volunteer Ss from 
introductory psychology classes served in the 
experiment. There were 12 Ss in each of the 
eight experimental subgroups. Each of the four 
Es conducted the experiment with a propor- 
tionate number of Ss in each condition. 

Procedure.—All eight subgroups first received 
a training series of 20 five-letter anagrams each 
typed in lower case on 3 X 5-in. white index 
cards. "There was only one solution for each 
of these anagrams. Half of the 96 Ss received 
training anagrams that could be solved by a 
word referring to an aspect of eating or food. 
For example, the anagram “ocnba” is solvable as 
“bacon.” ‘The anagrams were in no consistent 
letter order. Prior to the start of the training 
series, half of these 48 Ss received instructions 
to look for words referring to eating or different 
foods. This is the eating-instructions (EI) 
training group. ‘The other 24 Ss received no 
special task instructions, thus forming the eating 
(E) training group. 

The remaining 48 Ss also received training 
anagrams with only one possible solution. It 

! This study was supported by a grant from 
the Committee on Research, University of 
California. A portion of this paper was pre- 
sented at the 1954 meetings of the American 
Psychological Association in New York City. 

2Now at the University of California, 
Riverside. 


was always in the same letter order, 32145. 
The third letter in the word was the first in the 
anagram, andsoon. For example, the anagram 
“tamch” is solvable as “match.” ‘These words 
were not related to eating or any other category 
in any obvious consistent fashion. Prior to the 
start of the training series, half of these 48 Ss 
were given instructions to look for a given letter 
order in the anagrams. ‘They formed the order 
instructions (Ol) training group. ‘The remain- 
ing 24 Ss received no special task instructions, 
thus forming the order (O) training group. 
Thus four different conditions of training were 
employed, El, E, Ol, and O. 

A series of test anagrams was introduced im- 
mediately after the training series. All Ss 
received the same test which consisted of 10 
five-letter anagrams each possessing several 
different words as solutions, one of which was an 
eating word. Each of the four training groups 
was divided into two subgroups for the test 
series. Half the Ss were instructed to look for 
eating words while half received no special task 
instructions. Thus eight different conditions 
were present in the test series. ‘Three experi- 
mental variables were employed: training 
anagram, instructions prior to training, and 
instructions prior to the test series. They 
permitted a 2 K 2 X 2 factorial analysis of the 
number of eating or set solutions obtained on the 
multiple solution test anagrams. 

Further details of the procedure and the 
instructions to Ss are similar to those in the 
experiment by Maltzman and Morrisett (3). 
The training and test anagrams were all obtained 
from the lists employed by Rees and Israel (4). 


ResuLts AND Discussion 


The mean number of eating-set 
solutions obtained in the test series 
by the different groups is shown in 
Table 1. Bartlett’s test indicated 
that the variances were homogeneous. 
A summary of the analysis of variance 
of the test data is given in Table 2. 
The analysis indicated that two of the 


418 











TASK INSTRUCTIONS FOR ANAGRAMS 


TABLE 1 


Mean Eatine-Setr So.vutions to Test ANAGRAMS 





] 
Test Instructions Order Instructions 


Mean 


Eating instructions 
Eating only 


three main effects were significant: 
the class of anagrams presented during 
training and the nature of the in- 
structions given prior to the test series. 
The nature of the instructions given 
prior to the training series did not 
significantly affect performance in the 
test series. None of the first- or 
second-order interactions were sig- 
nificant. 

The significant F for training ana- 
grams indicates that a set for eating- 
word solutions was established 
through the effects of training, a 
finding obtained in several other 
studies (2, 4). The Ss in the four 
subgroups receiving eating anagrams 
during the training series had a mean 
of 6.7 set solutions in the test series. 
The Ss in the four subgroups receiving 
practice in solving letter-order ana- 
grams obtained mean of 5.9 set 
solutions on the test series. 


a 


TABLE 2 


Anaysis or Variance or Set Soivtions 
Y ro Test ANAGRAMS 


Anagrams 

Training instructions 

Test instructions 

Anagrams X Training 
instructions 

Anagrams Test instruct 

Training instructions X Test 
instructions 

Anagrams X Training inetruc 
tions X Test instructions 

Within groups 


ions 


*Pm.« 
7 P «= < 


OS 
01 


Order Only 
Mean 


6.42 
5.47 


Training Instructions 


Eating Instructions Eating Only 


SD 
a ’ 
2.09 
1.51 


Mean 


7.17 
6.50 


The task instructions given prior to 
the test series was the other variable 
producing a significant effect. The 
lack of significant in- 
dicates that whether or not training 
anagrams and training task instruc- 
tions were congruent with the test 
instructions, task instructions for 
eating solutions given immediately 
before the test produced a significant 
increase in this class of solutions. 


interactions 


The Ss in the four subgroups re- 
ceiving test-task instructions had a 
mean of 7.0 set solutions. Fifty-four 
per cent of these Ss gave a set solution 
on the first test anagram. ‘The Ss in 
the four subgroups without special 
task instructions before the test 
series obtained 5.7 mean set solutions. 
Twenty-five per cent of these Ss gave 
an eating solution on the first test 
anagram. A 
for continuity 


corrected 
indicated that 
relative frequency of set solutions 
obtained on the first test anagram by 
these different subgroups is significant 
(7.36, P < 01). These 
dicate that the difference 
solutions obtained by the 


chi square 


the 


results in- 


in set 
kl and 
—E groups cannot be attributed to 
the acquisition of the set by the — EI 
subgroups during the course of the 
test series. 

The immediacy of the effect of the 
task instructions is further indicated 
by a comparison of the different O- 
groups. Fifty per cent of the Ss in 











420 


the OI-EI and O-EI groups gave a 
set solution on the first anagram of 
the test series while 12.5% of the Ss 
in the OI-E and O-E groups gave a 
set solution (x? = 6.21, P < .02). 


Although test-task instructions pro- 
duced a significant increase in set 
solutions in the test series, training-task 
instructions did not have such an effect. 
A previous study by Maltzman and 
Morrisett (3) did obtain an increase in 
set solutions with task instructions given 
prior to the start of the training series. 
Rees and Israel (4) obtained comparable 
results in a similar experiment. How- 
ever, the class of anagrams used in these 
studies—letter orders, and nature words 
—typically produce greater sets than 
eating words. This implies, of course, 
that the effectiveness of a given kind 
of task instruction will vary with the 
classes of anagrams employed. A de- 
termination of the generality of the 
effects of task instructions on set solu- 
tions would also require the systematic 
variation of the strengths of training 
and task-induced sets with different 
classes of anagrams. 

Maltzman (1) has recently elaborated 
upon an interpretation of problem solv- 
ing which may incorporate the phe- 
nomena traditionally described as mental 
set and determining tendency. Briefly, 
the theory assumes that mental sets 
established through training produce a 
change in the compound. habit-family 
hierarchy by increasing the habit 
strength of a particular class of responses. 
Sets, or determining tendencies, estab- 
lished by task instructions produce a 
change in the compound habit-family 


I. MALTZMAN, E, EISMAN, L. 0. BROOKS, AND W. M. SMITH 


hierarchy by increasing the reaction 
potential of a class of responses through 
arousal of their anticipatory goal re- 
sponse. Sets developed through training 
are based on the growth of habit strength, 
and therefore develop relatively slowly 
and have a degree of permanence. In 
contrast, sets induced by instructions 
are labile, are aroused rapidly, and need 
not persist. The results of the present 
experiment appear to be in accord with 
this theoretical interpretation. 


SUMMARY 


The present experiment was designed to 
determine whether task instructions to look for 
a particular kind of solution to anagrams would 
induce a disposition for these solutions when 
previous training and instructions were in- 
congruent with suchaset. The results indicated 
that a set may be established by task instructions 
despite the incongruence of previous training 
and instructions. A theoretical analysis of sets 
induced by training and by instructions was 


outlined. 


REFERENCES 


1. Matrzman, I. Thinking: from a behavior- 
istic point of view. Psychol. Rev., 1955, 
66, 275-286. 

2. Matrzman, I., & Morrisert, L., Jr. Differ- 
ent strengths of set in the solution of 
anagrams. J. exp. Psychol., 1952, 44, 
242-246. 

3. Marrzman, L., & Mornisett, L., Jr. Effects 
of task instructions on solution of different 
classes of anagrams. J. exp. Psychol., 
1953, 45, 351-354. 

4. Rees, H., & Israet, H. An investigation 
of the establishment and operation of 
mental sets. Psychol. Monogr., 1935, 46, 
No. 6 (Whole No. 210). 


(Received June 3, 1955) 





Journal of Experimental Psychology 
Vol. 51, No. 6, 1956 


FURTHER GRADIENTS OF ERROR REINFORCEMENT 
FOLLOWING REPEATED REWARDED RESPONSES ! 


MELVIN H. MARX AND FELIX E. GOODSON 


University of Missouri 


In previously reported experimen- 
tation (5, 7) statistically significant 
spread-of-effect aftergradients were 
found in a Thorndikian (11) type of 
experimental situation with an ap- 
paratus designed to permit graded 
measurement of error strength and 
minimize systematic pre-experimental 
response sequences. Gradients were 
observed only after repeated rewarded 
responses. Bias in the res; se- 
quence, such as may be due to the 
operation of the guessing sequence 
(2, 3, 8, 9) or the probability-bias 
(10) factors, was controlled by using 
a control group of Ss who also re- 
peated, presumably by chance, the 
key response following which meas- 
urement of error strength was made. 

A modified Thorndikian hypothesis 
was proposed as a consequence of 
these results. It was designed to 
take into account the fact that error 
strengthening occurred only following 


It 


wise 


repetition of a rewarded response. 
stressed the necessity of the repetition 


of the rewarded responses, as an 
important component in the stimulus 
complex present when the succeeding 
response is made. 

It was the primary purpose of the 
present experiment to provide a 


' This research was supported in part by the 
United States Air Force under Contract No. 
AF33(038)-25631 monitored by the Perceptual 
and Motor Skill Research Laboratory of the 
Human Resources Research Center. Permission 
is granted for reproduction, translation, publi- 
cation, use, and disposal by or for the United 
States Government. Part of the research 
described here was reported at the 1952 meetings 
of the American Psychological Association at 
Washington (abstracted in Amer. Psychologist, 
1952, 7, 274-275). 


421 


further test of this particular hy- 
pothesis. This test was performed 
with a basically similar design but 
with a more naive type of S and an 
apparatus permitting the collection 
of group data. 


Metuop 
Subjects 


The Ss were 385 fifth- and sixth-grade public 
school students, used in groups of from 21 to 28 
The experiment was performed in the classrooms 
of the several schools involved. 


Apparatus 


\ small multiple-choice learning box was 
used, There were 27 rows of holes in the top, 
with 12 holes in each row. 
placed under this top. 
solid except for slits in some rows. 


A second board was 
The second board was 
These were 
the width of the holes above, and ran clear across 
the board under all 12 of the holes. A list of 27 
three-letter stimulus words appeared at the left 
of a mimeographed record sheet covering the 
arrangement of holes in the top of the box. As 
E called out the stimulus word, S responded by 
inserting a stylus furnished for the purpose into 
one of the 12 dots on the paper to the right of 
the stimulus word. These dots were directly 
over the holes in the top board, so the stylus 
always pierced the paper, even on incorrect 
choices. When one of the slits in the second 
board was under the row of holes, the stylus 
went clear in to the hilt. ‘The Ss were told that 
this represented a correct choice. A hinged top 
covered the apparatus when it was not being 
used, and made it possible to change the record 
sheets without revealing the underlying arrange 
ment to the S. 

A major advantage of this apparatus was the 
elimination of a set central resting position to 
which S had to return following each response. 
This feature of the earlier apparatus (5,7) had 
been a complicating factor in the interpretation 
of the gradients. 


Experimental Procedure 


The Ss were told that the test was one of their 
ability to learn and remember the correct hole 





422 


for as many of the words as possible. A number 
of items in the instructions were used to stimu- 
late motivation and disguise the purpose of the 
experiment? 

Experimental Ss were rewarded according to 
one of three prearranged conditions on the first 
two trials. They received rewards on either 
(a) the 14th response, (b) the Sth and 23rd 
responses, or (c) the Sth, 4th, and 23rd re- 
sponses. The positions of reward were deter- 
mined by the slits in the second board of the 
apparatus. Different sets of boxes had slits 
under different rows. One set had slits under 
Row 14, another under 5 and 23, and a third 
under 5, 14, and 23. ‘There were no slits in the 
boards for control Ss, who therefore received no 
reward on either of these two trials. These 
conditions are designated as l-reward (N = 67), 
2-reward (N = 57), 3-reward (N = 127), and 
O-reward (N = 138) groups, respectively. 

After the introductory instructions, Ss were 
shown a sample learning box in order to famil- 
iarize them with details of construction. This 
box had a random arrangement of correct holes. 
Each S was then given a small box containing 
only two holes through one of which the stylus 
passed up to the hilt and into one of which the 
stylus would go only } in. Each S punched 
these holes with his stylus in order to get the 
feel of the right and wrong signals. 

Trial 1.--The Ss were now given their full- 
sized boxes, together with a 5 X 8-in. blank 
card, They were told to punch once and only 
once in each row, and to cover previous punches 
with the card as they proceeded down the page. 
The Ss were asked not to talk to each other at 
any time during the experiment. They were 
also told not to look beneath the sheet of paper 
in the box at any time. Proctors assisted E to 
insure that these cautions were observed. 

The Ss were assured that this was a very 
difficult task and that they would probably not 
get many right, if any, but that they should 
keep trying. They were advised that if they 
guessed randomly rather than by some sys- 
tematic method they would probably do better. 
The words were read off by £ at the rate of one 
every 8 sec. A trial was completed when Ss 
had punched once in each of the 27 rows as the 
27 stimulus words were read off. 

Trial 2.—The sheets used on Trial 1 were 
removed by £’s assistants and new sheets were 

*The verbatim instructions for the experi- 
ment, along with tabulations and analyses of 
data summarized in the text, have been deposited 
with the American Documentation Institute. 
Order Document No. 4815, remitting $1.75 for 
35-mm., microfilm or $2.50 for 6 by 8-in. photo- 
copies. 


MELVIN H. MARX AND FELIX E. GOODSON 


placed in the boxes. This was done with the 
tops of the boxes down so that S could not see 
into the boxes. The Ss were told that on this 
trial they were to do two things: (a) repeat all 
those dots which they had gotten right on the 
first trial, and (b) try to get more new ones right. 
Instructions about talking, looking into boxes, 
etc. were repeated on this and the last two trials. 

Trial 3.—At the end of Trial 2 the boxes were 
interchanged among Ss. 0-reward and 3-reward 
Ss exchanged boxes, as did the 1- and 2-reward 
groups. These two additional trials with the 
change in correct responses were introduced to 
try to prevent discouragement among the 0- 
reward Ss. Trials 3 and 4 did not provide a 
good test of the hypothesis, because of inter- 
ference from earlier trials and possible response 
selection factors resulting from continuation past 
the second trial. Nevertheless, the data were‘ 
analyzed for whatever additional information 
they might give. The same stimulus terms were 
used on Trials 3 and 4, which did not tend to 
minimize interference. 

Trial 4.—Trial 4 was exactly like Trial 2. 
The Ss were told to repeat all those responses 
that they had gotten correct on Trial 3 and to 
try to make further correct responses. 


Treatment of Data 


The columns of dots punched by Ss on the 
record sheets were arbitrarily numbered from 
1 to 12 from left to right. Each S’s 27 responses 
on Trials 1 through 4 were recorded in terms of 
these numbers. Then the difference between 
the number of each response on Trial | and the 
corresponding response number on Trial 2 was 
computed, difference scores, which 
ranged from 0 to 11, provided a quantitative 
expression of the strength of the tendency to 
repeat a given response on successive trials. A 
low score thus indicated a strong tendency to 
repeat a response. This method is essentially 
the same as the difference-score measure pre- 
viously used (5, 7; cf. also 6). 

Within the experimental groups Ss were 
assigned to subgroups on the basis of their 
difference scores at the rewarded positions. The 
subgroups thus formed were: (a) repeaters 
those Ss having a difference score of 0 at one of 
the rewarded positions; (b) near-repeaters— 
those Ss having a difference score of | at one of 
the rewarded positions; and (c) nonrepeaters— 
those Ss having a score of 2 or greater at one 
of the rewarded positions. ‘This breakdown was 
planned on the basis of previous results (5, 7) 
which indicated the similarity of exact repeaters 
and near-repeaters, as here defined. 

Since some experimental groups received 
more than one reward per trial it was, of course, 


These 





GRADIENTS OF ERROR REINFORCEMENT 


possible for an S to have more than one score of 
0, 1, or 2 to 11 at key positions. Only the first 
such score was included in the appropriate 
analysis. No S appears more than once in any 
of the groups to which a statistical test has been 
applied. However, it is possible for a portion 
of an S’s responses to appear in more than one 
analysis. That is to say, since no direct com- 
parisons have been made among repeaters, near- 
repeaters, and nonrepeaters, an S having a 0 
score at one rewarded position and a | or a 2-11 
score at another may appear in two or three of 
these experimental subgroups. 

In order to obtain homogeneity of variance, 
it was necessary to transform the scores by 
taking the common log of (Y + .5). 

Portions of some protocols were discarded in 
advance of analysis. ‘The criteria for discarding 
data were: (a) failure to make a response in one 
of the critical positions, that is, in a rewarded- 
response position or a nearby error position; 
(6) making more than one response in any of the 
critical positions; and (c) using obvious guessing 
systems such as are indicated by Dots 1, 2, 3, 4, 
or 4, 6, 8, 6, 4. This last precaution was es- 
pecially important in the present study because 
of the impracticability of emphasizing, with 
grade-school Ss, the customary instruction to 
“guess at random” (cf. 6). The present in- 
structions did urge Ss to use different numbers 
in guessing. 


ReEsuLTs 
Repeaters 


The method of obtaining experi- 
mental repeaters has been described. 
Control repeaters were obtained from 
the O-reward group by taking the 
difference scores for four steps after 
the first repeated response between 
the 5th and 23rd positions, in case 
there was more than one such repe- 
tition. In rare cases where no repeti- 
tion occurred, the S’s data could not 
be used in this analysis. Earlier 
results (5, 7) indicate that the serial 
position of the repeated response is 
not crucial in determining whether a 
gradient will occur. There were 102 
experimental and 112 control re- 
peaters. 

The experimental and control after- 
gradients for Trials 1 and 2 are shown 
in Fig. 1. It will be noted that a 


- 


EXPER REPEATERS 


EXPER NEAR REPEATERS 
CONT WEAR REPEATERS 





——————Ee 


ERROR POSITIONS 


Fic. 1. Response-difference aftergradients 
for experimental and control repeaters and near 
repeaters, Trials | and 2 


regular and definite aftergradient 
occurred in the experimental group 
and that only a slight aftergradient 
occurred in the control group. No 
experimental foregradient was evi- 
dent. 

Transformed mean scores and vari- 
ances for the four aftergradient posi- 
tions are given in Table 1. For 
purpose of comparison this table also 
contains the data for near-repeaters 
and nonrepeaters. The trend test 
described by Linquist (4, Case 2) was 
applied to the _ transformed-score 
gradients. Only the experimental 
aftergradient departed significantly 
from a horizontal straight line (F 
= 8.92, P = .OO1 for 3 and 303 df). 

A direct comparison between ex- 
perimental and control aftergradients 
was made following Lindquist’s sug- 
gested method (4, Case 11). This 
analysis of variance indicated a sta- 
tistically significant difference in slope 
(F = 3.349, P = .O1 for 3 and 636 
df). In addition, the experimental 
mean difference score was significantly 
lower than the control group mean in 
the first aftererror position (F = 6.46, 
P = 05 for 1 and 212 df). 

The repeaters’ data were 
analyzed for exact error repetitions, 
which are the orthodox measures used 
in the earlier spread-of-effect studies 
Cochran’s (1) chi-square test for 
correlated variables was first applied 


also 





MELVIN H. MARX AND FELIX E. GOODSON 


TABLF 1 


TRANsrormep Means vor Arrexrcrapients, Triats | anp 2 


Subgroup 


Mean | Variance 


Experimental 
Repeaters 
Control 
Repeaters 
Experimental 
Near-repeaters 
Control 
Near-repeaters 
Experimental 
Nonrepeaters 
Control 
Nonrepeaters 


2845 | 1417 


| A311 | 1385 


2612 | 1357 


| 4488 | .1593 


| 3517 


| 3961 | 


independently to the fore- and after- 
gradients. The results of this test 
confirmed those obtained by trend 
tests based upon the difference scores. 
Also, there were significantly more 
error repetitions in the first aftererror 
position for the experimental group. 
A further measure of the tendency 
to repeat errors was obtained by cor- 
relating the responses for each error 
position on Trial 1 with the responses 
at that position on Trial 2. Table 2 
presents the correlations and tests of 
the reliability of the differences be- 
tween experimental and control r’s. 


TABLE 2 


Correcations Berween Erron Responses 
or Repeaters, Triats 1 anv 2 


Experimental Difference 


| 
Control | 
| 


s s ‘ 
a 
| 


} 
| 
| 


054) .454| 3.27° 
024) 135) 136) .112 | 0.81 
—.129) 039) 039) .1681 1.21 
— 099 087 087 012 | 0.09 





400 | —.054 |— 





4226 | 
4385 | 


| 4627 | 


Error Positions 


+2 | +4 


Mean | Variance | 


Mean | Variance | 
i 
s nee 


' 
Mean | Variance 


1441 1269 | 5231) .1510 


1355 1389 | 4666) 1598 


A578 | 1118 1204 | 4640 


4146 | 


2069 .1400 


1169 1461 


506A | 1241 


1418 


The only significant difference in 
correlations between the two groups 
was at the first aftererror position, 
where the experimental r of .38 was 
itself significantly above zero at the 
O1 level. These results corroborate 
those obtained by the difference-score 
method and indicate that the pre- 
ceding reward is a necessary condition 
for the repetition of errors. The 
correlation measure is especially im- 
portant as a supplement to the dif- 
ference-score measure because it is 
relatively insensitive to departures 
from normality, and is therefore fairly 
free of any distributional 
arising as a result of reward. 

In order to eliminate any group 
guessing sequences as alternative ex- 
planations of the major results a 
partial correlation technique was also 
applied. When the correlations be- 
tween the rewarded response position 
and the first aftererror position on 
each trial were partialled out, the 
correlation between the two _ first 
aftererror positions remained signifi- 
cant. The r was only slightly re- 
duced, to .34, still significant at the 
Ol level. This means that the pos- 


artifacts 





GRADIENTS OF ERROR REINFORCEMENT 


sible concurrence on both trials of a 
relationship between the location of 
the rewarded response and the error 
immediately following cannot be used 
as an explanation of the strength of 
this error. 

A check was also made of the 
possibility that some side effect of 
reward operating on the immediately 
following afterresponses on both trials 
was responsible for the first aftererror 
strengthening. ‘The most important 
possibility seemed to be that 
following a reward S tended to re- 
strict the next response to the same 
general area. ‘The partial correlation 
technique takes care of this possi- 
bility in part, but a more direct test 
was made. The experimental 
and aftergradients were 
broken down according to the differ- 
ence between the repeated response 
and the first aftererror on Trial 1. 
Examination of these data indicated 
that (a) experimental and control Ss 
showed about the same tendency to 
stay close to the repeated response, 
and (b) the first aftererror repetition 


here 


also 
control 


tendency was greater in all experi- 


mental classifications except the last 
one. These facts suggest that arti- 
ficial response restriction following 
reward was not an important gradi- 
ent-producing factor. 

The experimental data in terms of 
difference-score gradients around re- 
peated rewarded responses were 
analyzed separately for each of the 


425 


three serial positions, 5, 14, and 23. 
All repeated rewarded responses at 
each position were used for this 
analysis. Each of the marked after- 
gradients was significant at beyond 
the .O1 level, as shown in Table 3. 
These results that the sig- 
nificantly positive results based upon 
first occurrence of the repeated re- 
warded response and compared with 


indicate 


repeated nonrewarded controls were 
The 
foregradients were irregular and did 
not justify statistical test. 


not a function of serial position 


Near-Repeaters 


Gradients of error strength around 
near-repetitions (that is, intertrial 
variations in the correct response of 
only one unit) were also analyzed on 
the assumption that most such cases 
probably represented very close ap- 
proximations to successful repetition. 
A number of such near-misses would 
be expected in view of the lack of 
numerical or other definition on the 
response scale. Experimental near- 
repeaters were selected in the same 
manner as 
Difference were recorded for 
four steps before and after the first 
difference score of 1 occurring in a 
rewarded-response position. Control 
near-repeaters were obtained by first 
locating all control Ss with difference 
scores of | at Positions 5, 14, or 23 
the the 


experimental repeaters. 


scores 


equivalents of rewarded 


TABLE 3 


Artercrapvients or Att Repeaters at Facn Rewarpep Posrrion, Trtats | anv 2 


Reinforced 
Serial 
Position 

+2 


* Based on transformed scores 


Mean Difference Score by Error Position 


Trend Test® 





426 


positions in the experimental groups. 
The first such difference score was 
used, except for those Ss who hap- 
pened to have three | scores, in which 
cases the selection was made at 
random. ‘There were 52 experimental 
and 51 control near-repeaters thus 
obtained, 

The aftergradients for the near- 
repeaters are shown in Fig. 1. Again 
the experimental aftergradient was 
more marked than the control. 
Trend tests were made of all gradients, 
just as they were for repeaters. 
Again, only the experimental after- 
gradient was significantly different 
from a straight horizontal line (F 
= 6.15, P = O1 for 3 and 153 df). 

The over-all analyses of variance 
on transformed scores yielded a sig- 
nificant difference between the experi- 
mental and control aftergradients 
(F = 3.42, P = .02 for 3 and 303 df). 
The experimental-control difference 
between the mean transformed dif- 
ference scores at the first aftererror 
position was also significant (F 
= 6.01, P = .02 for 1 and 101 df). 


Nonrepeaters 


These Ss were selected in the same 
manner as experimental and control 
repeaters, i.e., the first score of 2 
through 11 that occurred in response 
Position 5, 14, or 23 was used to 
represent each S. Raw-score gradi- 
ents for first to fourth errors were as 
follows: 2.68, 3.19, 3.60, and 3.57 
for the experimental Ss; 3.02, 3.64, 
3.87, and 3.45 for the control Ss. 

Transformed aftergradient data for 
experimental and control nonrepeaters 
are given in Table 1. Although 
relatively weak compared with ex- 
perimental repeaters’ and near-re- 
peaters’ gradients, both of these were 
nonetheless statistically significant 
(experimental F = 5.58, P = .O1 for 


MELVIN H. MARX AND FELIX E, GOODSON 


3 and 384 df; control 

P = .O5 for 3 and 375 df). A direct 
comparison of the two gradients 
showed the experimental to be 
stronger at the .06 level of significance 
(F = 2.54, for 3 and 756 df; 2.62 
required for .05 for 3 and 500 df). 
The intertrial r for raw first aftererror 
responses was .15 for the experimental 
Ss, just short of significance at the .05 
level (r of .17 required for N of 125). 
The r for the control Ss was .01. 


F = 3.32, 


Trials 3 and 4 


The analysis of the data provided by Trials 
3 and 4 was conducted in exactly the same 
manner as that of the data from Trials | and 2. 
The major analyses are here briefly reported. 

Trend tests were made of the repeaters’ 
gradients from 115 experimental and 67 control 
Ss. Only the experimental aftergradient de- 
parted significantly from a straight line (F 
= 9.66, P = O1 for 3 and 342 df). 

Direct comparisons of the slopes and means 
of the experimental and control groups revealed 
no significant differences. Nor was the mean 
difference in the difference scores at the first 
aftererror position significant (F = 2.57; need 
F = 3.89 for P = .05). 

The correlation between responses on Trials 
3 and 4 at the first aftererror position for the 
experimental Ss was .27 (P = .01), and for the 
control was not significant (r = .16). The 
difference between these correlations was not 
significant. 

The partial correlation technique was applied 
to the experimental group correlation, partialling 
out the correlations between the location of the 
rewarded response and of first aftererror re- 
sponses. The resulting intertrial correlation 
was .26, still significant at the .01 level. 

Near-repeater gradients were based upon 57 
experimental and 35 control Ss. None of the 
gradients was significantly different from a 
straight line. The experimental aftergradient, 
however, was significantly’ stronger than the 
control in terms of level (F = 6.33, P = .02 for 
1 and 270 df). The difference between the 
difference-score means at the first aftererror 
position also significantly favored the experi 
mental group (F = 12.77, P = .O1 for 1 and 90 
df). 

Nonrepeater gradients were based upon 10% 
experimental and 76 control Ss. Only the ex- 
perimental aftergradient was statistically sig 
nificant (F = 3.62, P = .02 for 3 and 321 df). 





GRADIENTS OF ERROR REINFORCEMENT 


It did not differ significantly, however, from the 
control gradient. 


Discussion 


The present results offer confirmation 
of those previously obtained (5,7) witha 
different type of Sand apparatus. Trials 
1 and 2, which constituted a clear test, 
gave the strongest confirmation. It is 
especially important to note that direct 
comparisons of the experimental and 
control data yielded statistically sig- 
nificant differences. Less clear-cut sup- 
port was given by Trials 3 and 4, where 
there were contaminating factors. 

The finding of significant aftergradi- 
ents following nonrepeated key responses 
requires comment. This result was un- 
expected on the basis of the previous 
results. The fact that experimental 
aftergradients following nonrepeated re- 
warded responses occurred both on 
Trials 1-2 and 3-4 offers support for 
Thorndike’s original “spread” hypothesis 
(11). This is particularly true since the 


experimental aftergradient on Trials 1-2 
was found to be stronger than the control 


at the .06 level. If verified, this result 
would eliminate the need for the modified 
Thorndikian hypothesis previously ad- 
vanced. 


The fact that even the control aftergradient 
following nonrepeated key responses was sig- 
nificant on Trials 1-2, suggested that perhaps 
there was something unusual about Positions 5, 
14, and 23, since this result would not be predicted 
on the basis of the guessing-sequence or similar 
response-bias hypothesis which involves repeated 
responses as anchors. A special check of the 
control nonrepeater aftergradient following 
intermediate response Positions 9 and 18 in- 
directly supported this suggestion, since no 
tendency towards a gradient was observed (raw 
mean difference scores of 4.29, 3.61, 3.70, and 
3.44 for first through fourth after position). A 
second special check was made on control Ss 
who had repeated errors in the key positions 5, 
14, and 23. Only 44 Ss were found who had 
repeated at one of these positions, and their 
aftergradient was essentially similar to that 
observed for the larger control subgroup based 
upon all response positions (3.00, 2.89, 3.50, and 
3.45). This gradient was not significant (F 
< 1.00), nor was the experimental repeater 
aftergradient significantly stronger than it 


427 


(F = 1.32, P > .05 for 3 and 432 df). However’ 
it should also be noted that the control near- 
repeaters, based upon response positions 5, 14, 
and 23, failed to provide a significant after- 
gradient. These considerations suggest that 
this single significant control aftergradient found 
for nonrepeaters may most plausibly be regarded 
as a sampling variation. 


This conclusion is fortified by the 
failure to find any intertrial correlation 
(r = .O1) for the first control aftererror 
position. But such considerations leave 
unresolved the problem of the two sig- 
nificant experimental nonrepeater after- 
gradients, which are equally embarrass- 
ing to the modified Thorndikian (5, 7) 
and the guessing-sequence (3) hypotheses 
since each depends upon repetitions of 
key responses. The present results at 
least suggest that this problem be care 
fully considered in subsequent research 
and that the necessity of repetition of key 
response not be arbitrarily 
without checking the data. 

With the qualifications imposed by 
the above comments, 
interpreted as offering 
modified Thorndikian spread-of-effect 
hypothesis previously advanced. The 
necessity for reward is indicated by the 
failure to find statistically significant 
control gradients following repeated non- 
rewarded responses. This control and 
the use of the partial correlation tech 
nique in the present study eliminate the 
guessing-sequence and the probability- 
bias hypotheses as possible explanations 
of the positive results. 

The finding of a significant correlation 
for only the first aftererror raises a 
question as to the extent of the spread 
of reward. Whether or not the after- 
gradient extends beyond the first step is 
a question which must be decided by 
further experimentation. 


assumed 


the results are 
support for the 


SUMMARY 


This study involved the analysis of data 
provided by 385 fifth- and sixth-grade public- 
school children who were given four trials on an 
orthodox Thorndikian type of multiple-choice 
learning task with prearranged conditions of 
reward. In general it was found that the errors 
made immediately following a repeated rewarded 





428 


response tended to be significantly stronger than 
comparable errors made immediately following 
a repeated but nonrewarded response. The 
guessing-sequence factor and similar nonrandom 
biasing of the response sequences were controlled 
by use of repeated nonrewarded responses as 
well as by application of a partial-correlation 
technique. Some tendencies toward after- 
gradients were also found following nonrepeated 
responses, however, constituting support for the 
original Thorndikian spread-of-effect hypothesis. 
With this exception, the results are interpreted 
as confirmation of similar ones previously ob- 
tained with a different apparatus and a less 
naive type of S, and as offering support for the 
modified Thorndikian spread-of-effect hypothesis 
previously advanced. 


REFERENCES 


1. Cocuran, W. G. The comparison of per- 
centages in matched samples. Bio- 
metrika, 1950, 41, 256-266. 

2. Facan, C. A.,& Norwru, A.J. A verification 
of the guessing sequence hypothesis about 
spread of effect. /. exp. Prychol., 1951, 
41, 349-351. 

3, Jenxins, W. O., & Suerrietp, F.C. Re- 
hearsal and guessing habits as sources of 


MELVIN H. MARX AND FELIX E. GOODSON 


the “spread of effect.” 
1946, 36, 316-330 
Linpquist, FE. F. Goodness of fit of trend 
curves and significance of trend differ- 
ences. Psychometrika, 1947, 12, 65-78. 
Marx, M. H. 


ents 


J exp P ychol., 


Error-reinforcement gradi 

with bias in the 
controlled. Amer. Psychologist, 1951, 6, 
293. (Abstract) 

. Marx, M. H. Spread of effect: a critical 
review. Genet. psychol. Monogr., 1956, 
53, 119-186. 

Marx, M. H. 
ment in a serial perceptual-motor task. 
Psychol. Monogr., in press. 

. Suerrierp, F. D. “Spread of effect” 
without reward or learning. J. exp. 
Psychol., 1949, 39, 575-579. 

. Suerrievp, F. D., & Jenxins, W.O. Level 
of repetition in the “spread of effect.” J. 
exp. Psychol., 1942, 44, 101-107. 

. Smrrn, M. H. Spread of effect is the 
spurious result of non-random response 
tendencies. J. exp. Psychol., 1949, 39, 
355-368. 

11. Tuornpoike, E. L. An experimental study 
of rewards. Teach. Coll. Contr. Educ., 
1933, No. 580, 


response series 


Gradients of error reinforce 


(Received April 18, 1955) 





Journal of Experimental Psychology 
Vol. 51, No. 6, 1956 


SAMPLED-DATA TRACKING: 


OPERATOR’S OUTPUT! 


CORWIN A 
Hluw an Factor 


In the usual display-control situ- 
ation under normal operating con- 
ditions, the operator closes a control 
loop by tracking a target. Thus, in 
a compensatory tracking task, when 
the operator sees a discrepancy be- 
tween a target and fixed 
cross hairs, he displaces his control in 
order to bring the target back onto 
the cross; hairs. 


movable 


The operator’s out- 
put or response is then continuously 
sensed and utilized by analog equip- 
ment. However, when such an ana- 
log device is replaced by a digital 


computer obtaining solutions at finite 


time intervals, the operator is placed 


in a The digital 
computer can accept the output of the 
operator as an input only at finite 
time Accordingly, the 


computer will present solutions as an 


unique situation. 


intervals 


information input ofr display to the 


operator only at such intervals. 
Thus, the 


continuously 


operat vr, acting near- 


on an analog control 
device which is sampled at discrete 
the 


puting equipment, sees the results of 


time intervals by digital com- 


his activities as discrete events on the 
display, smoothed only by whatever 
inertia may be present in the control 


mechanism.” It is pertinent to ask 


Ihe writer would like to acknowledge the 
contributions to various phases of this work of 
M. W. Olson, M. M. Dickinson, R. M. Lewis, 
and J. S. Renfro of IBM’s Airborne Computer 
Laboratory; W. J. Merrill and P. M. Seever of 
the Scientific Computation Laboratory, 
W. L. Jenkins of Lehigh University 


2 This smoothing is such that, without inertia, 


and 


a moving target af pears as a succession of dots 


on the display Thus, the target is stationary 


except at sampling time when it jumps to another 


position. With sufficient inertia the moving 


Group, International Busine 


SAMPLING OF 


BENNETT 
Machines C 
what effects such a digitalization will 


the 
achievements. 


have on operator's tracking 


Particularly, one 


might anticipate a degradation of 
tracking 
sampled-data conditions. 

Previous experiments (1, 2, 4), in 


which sampling or intermittency was 


performance under such 


present in the control loop, included 
sampling of the input to the operator 
rather than output the 


operator's standpoint, however, there 


his From 
is a difference between sampling his 
In 
sampling his input, the occurrence and 
result: of 
apparent. 
sampling 
the 
back to him through the computer 
Figure 1 the 
general nature of the control loop for 


input and sampling his output 


sampling are immediately 
In sampling his output, 
only 


operator's response is fed 


effects are noticeable 


after 


and display. shows 


the two types of sampling conditions.* 

The 
control-loop sampling is the sampling 
Battig et al. (1) studied 
between .4 cps and 16.1 


primary variable entailed by 


rate. rates 


An 


cps. 


target appears as Further 


experimentation is being conducted on the effects 


a continuous trace 


of amount of inertia on sampled-data tracking 
performance 

* These delays due to sampling may vary from 
Thus, 
ata 2-cps rate, the delay betweer operator 
at least 4 Bec 
In addition to 
exponential lelaye ! 


jue to s 


one to two sampling periods in length 


response an ithe dis} la y ed effect is 


and no more than | sec ampling 


delays, there are the 
present experimentation 
that after .25 sec., approximate 


thirds of the total target dist lacement 


suc h 
will have 
taken place 

‘Sampling might well take place at both the 
operator's imput lisplay) and output (cor ol) 
Studies are currently being performed on such a 


control systen 


429 








DISPLAY 











CORWIN A. BENNETT 








OPERATOR 











CONTROL 














EXTERNAL 
DISTURBANCES 























DISPLAY 
ORIVE 
SERVOS 





ANALOG 
COMPUTER 














OPERATOR'S INPUT SAMPLED 





OISPLAY 














OPERATOR 




















EXTERNAL 
DISTURBANCES 














DISPLAY 
ORIVE 
SERVOS 








CONTROL 














ANALOG -TO- 
DIGITAL 
CONVERTER 


























DIGITAL -TO- 
ANALOG 
CONVERTER 














DIGITAL 
COMPUTER 





OPERATOR'S OUTPUT SAMPLED 











Fic. 1. Control loop block diagrams: 


optimal rate between 6 and 16 cps 
was found with large decrements in 
time on target at lower rates. This 
is contrary to the previously implied 
expectation that with an_ infinite 
sampling rate (continuous), perform- 
ance would be optimum, and, as 
sampling rate is reduced, performance 
would be degraded. This experiment 
was repeated (2) with control of 
previously uncontrolled variables. 
No optimum rate was found, but a 
linear increase in performance for 
rates from 2 to 16 cps. 

Katz and Spragg (4) studied mean 
tracking error as a function of inter- 
mittency as related to operator’s input 
sampling. They describe their find- 


Sampling of operator’s input and output. 


ings as showing a positive logarithmic 


relationship between tracking ac- 
curacy and illumination frequency. 
In another experiment described in 
the same report, a 4-cps rate and a 
continuous condition were investi- 
gated. The continuous condition 
proved to be superior. 

In addition to the effects of sam- 
pling in the control loop, one might 
wish to determine the effects of 
introducing constant delay periods 
between the operator’s response and 
effects of the response as presented 
on the display. Such delays have 
have been called “transmission-type 
delays” and were demonstrated by 
Warrick (8) to result in degraded 





SAMPLE-DATA TRACKING 


tracking performance in one type of 
tracking situation. Rockway (6) has 
reviewed studies of the effects of this 
and other types of delays on tracking 
performance. 

A further problem common to all 
control devices is that of “control/ 
display ratio,” or, “sensitivity” of 
the Control/display ratio 
may be defined as the magnitude of 
displacement (or one of its deriva- 
tives) on the display, produced by a 
given control displacement. The 
sensitivity might be expected to be of 
especial importance in the sampled- 
data situation. 

One further variable was introduced 
into the experimentation, that of 
tracking task. One type of tracking 
task arises when the target is in- 
stantaneously displaced (by an ex- 
ternal disturbance) from the cross 
hairs—“‘discretely displaced target.” 
The task then is to return the target 
as quickly as possible to the cross 
hairs. Another tracking task arises 
when the target is slowly but con- 
tinually displaced at a given rate from 
the cross hairs—“‘constantly displaced 
target.” The operator’s task is then 
to set a rate into the system equal in 
magnitude and opposite in direction 
to the disturbance, and thus maintain 
the target on the cross hairs. 

It should be pointed out that, with 
the present instrumentation, a given 
displacement of the operator’s control 
moves the target at a given rate. 
Therefore, the first task requires the 


control. 


* Some writers prefer “display/control ratio” 
or “magnitude of hand-control displacement 
required to effect a given rate of movement on 
the display.” The author follows Chapanis 
et al. (3, pp. 325, 328), and more recently, 
Rockway (6). Rockway summarizes investi- 
gation of control /display ratio, but unfortunately 
these investigations have no necessary bearing 
on the present situation since they have been 
concerned with position (displacement) control 
rather than rate control. 


431 


operator to move the discretely dis- 
placed target at some rate, releasing 
the deadman switch at the appro- 
priate time to stop the target on the 
cross hairs. ‘The second task requires 
the operator to try to produce a rate 
equal and opposite to the disturbing 
rate. By the time such a rate is 
discovered, however, the target has 
been displaced from the cross hairs. 
Thus, the operator must simultane- 
ously try to remove both rate and 
position errors. 

In summary, the present problems 
in regard to sampled-data tracking 
are the determination of: (a) Track- 
ing performance as a function of 
sampling rate and hand control sensi- 
tivity for discretely displaced targets; 
(b) tracking performance as a function 
of sampling rate for delay and no- 
delay conditions for discretely dis- 
placed targets; 


and (c) tracking 


performance as a function of sampling 


rate for constantly displaced targets 
Three experiments were carried out to 
determine these three functions. 


Metuop 


Apparatus.—The simulation of the digital 
control loop in which S’s output is sampled is 
shown in Fig. 2. The digital computer was 
simulated by an analog computer (a modified 
‘“integral-differential analyzer’) and relays. In 
this simulation sampling ‘on 
which signals were 
mately 15 msec. 


time,” during 
“accepted,” was approxi 
Sampling “off time” was the 
reciprocal of sampling rate (in cps) in seconds 
minus “on time.” 
The task was a 
tracking situation. The hand control used was 


a spring-centered rate-controlled joy-stick (8 


non-aided compensatory 


in. radius), equipped with a dead-man switch. 
Forces required for the displacement of the 
control were .5 |b. initially and 2.5 Ib. for total 
displacement. 

The display was a yg-in. spot presented on a 
DuMont 5-in. laboratory cathode-ray 
loscope (Type 804-A) with a P-1 phosphor. 
The scope was provided with cross hairs and 
three concentric markers—4, 1, and 2 in. in 
radius. The equipment was located in a semi- 
darkened room. The Ss were allowed to set 


oscil 





CORWIN A. BENNETT 





RECORDER 


SAMPLING 
RAT 





BRUSH 





AMPLIFIER 


E 
GENERATOR 








RECORDER 



































SAMPLING 
SWITCH AND 
RELAYS 























STORAGE 
CONDENSER 

















ANALOG 
COMPUTER 





DISTURBANCE 
GENERATOR 














DISTURBANCE 
INDICATOR 
































HAND 


CONTROL OPERATOR 














OSCILLOSCOPE 
DISPLAY 


CONTROL 
PANEL 

















Fic. 2. 
their viewing distance. ‘These distances varied 
between 12 and 16 in. 

Sensitivity changes were accomplished by 
potentiometer adjustments in the analog com- 
puter. Since the external disturbances entered 
the loop subsequent to these potentiometers, 
changes in gain sensitivity had no effect on what 
S saw in the way of disturbances. 

Target position was plotted as a function of 
time on a Brush Recorder. Measurements of 
the criteria (recovery time and 50% limits) were 
made manually from these records. 

Subjects.—The Ss were an incidental sample 
of engineers and technicians. The same seven 
Ss were used in Exp. 1 and 2. ‘The seven Ss 
used in Exp. 3 had previously taken part in 
either the first two studies or in a similar study 
conducted previously. ‘Those Ss unacquainted 
with the purposes of the experimentation were 
familiarized with them at the beginning of the 
study. 

Experiment 1.—Experiment | was designed to 
determine the relationship between sampling 
rate, hand-control sensitivity, and tracking per- 
formance for discretely displaced targets. 

The choice of sampling rates to be studied 
was based on the results of a pilot study. In 
that study, a 2-cps rate seemed a limiting case 





Simulated digital control system. 


beyond which tracking was unreasonably poor. 
Other sampling rates, chosen in the region at 
which satisfactory tracking was expected, were 
3, 4, and 6 eps. 
analog) condition was included as a reference 


A continuous (nonsampled, 


situation. 

Experience with sampled-data tracking prior 
to the experimentation indicated 
sampling rate decreased, the optimum hand- 
control sensitivity decreased; that is to say, for 
lower sampling rates, it was desirable to have a 
given control effect less 
movement. For any given sampling rate, it 


that as 


movement display 
was expected that an optimum sensitivity would 
exist, and either a higher or a lower sensitivity 
would result in poorer tracking. 
tivities 


Three sensi 


were chosen so that an apparently 
optimum, a high but reasonable, and a low but 
reasonable sensitivity for each sampling rate 
would be used.* Specific sensitivities used for 
each sampling rate are shown in Table 1. 


that a wide 


include the 


* Originally it intended 
enough range of sensitivities to 
optimal sensitivity for each sampling 
(studied) would be included in 
factorial design. For reasons of economy of 


was 


rate 
a complete 


time this was not done. 





SAMPLE-DATA TRACKING 


TABLE |! 


anp SampuinGc Rates Usep 
in Exp. 1 


SENSITIVITIES 


Sensitivit Inches /Second / 20 
Hand-Control Deflection 
Sampling Rate 

(cps) 


mal High Rate 


Continuous 


“Recovery time’’ was used as the criterion of 


tracking performance 


Recovery time is de 


fined as the time from the introduction of the 
disturbance until S brings the target within the 
j-in. circle and maintains it there 

Five series of eight disturbances each 
used as the tracking tasks. 


were 
\ disturbance would 
drive the target to the l-in. 
circle at one of the cardinal headings, or to the 


intercardinal 


instantaneously 


circle at the 


Each made up of four 
disturbances of each magnitude, with heading 


2-in. cardinal or 


headings. series was 
and magnitude randomized 

Tracking was performed on 12 successive 
work days, with 7 comy lete replications of the 
15 conditions for ea h S. 


cations 


The first three repli 
served as which the 
conditions were presented in sequence to case 
the transition 
On the last 


presented in 


training, during 


from one condition to the 


next 


four replications conditions were 


random order to eliminate any 
systematic effects of change from condition to 
Data these 


re} lications were used in the analy 


! 


condition irom only 


last four 
sis of results 
Qin any given day, S operated under about 
9 of the 15 different conditions of 
The Ss 


bring the target wit! 


ampling rate 


and sensitivity were instructed to 
iin the 4-in. circle as quickly 
as possible. Before any given condition, S was 
1)-sec. practice ses 


was allowed to move the 


get the “feel” of 


given a ion, during which he 
ll and thus 


condition as an 


target at w 
the control 
attempt at 


eliminating  order-of-conditions 


effects). Subsequent to the practice period, a 
ready signal was given and the trial started 
The 
intervals during the 
10-sec 


I “periment 2 The 


followed 


disturbances were pre sented at 12-sec 


training sessions, and at 
intervals thereafter 

second experiment im 
The 


tracking per 


mediately the first 


same Ss, 
disturbances, and criterion of 


This study 


to determine the effects of 


formance were used was designed 


} y on 


tracking 
performance as a function of san:eling rate 


In this experiment, a constant delay period 


equal to the sampling interval (e.g., .35 sec 


for 3-cps sampling 


was introduced between Ss 
response and indications of the response on the 
display Based on the results of Ixy 1, the 
following samy ling rates were chosen 3, 4 5, 
6, and 8 eps Fach of these sampling rates was 
studied with and 


peri xd 


without the additional delay 


Indications were that, as optimun 
sensitivity decreases with samp! rate, ita 
decreases with delay (ne eitivit “ 


chosen for each sampling rate to be optimum fo 


the delay condition (and, thus, le 


for the no-delay conditior Thus, any decre 
ment in performance using the delay conditions 
could not be attributed to sensitivity These 


sensitivities were 3 , 2, and 2.5 in. per 
sec. per 20° of hand-control displacement 


The training period again consisted of three 


replications, but for the dela 1 


conditions only 


since Ss already had experience under the no 


delay conditions In the final four replic tions, 


Ss performed under the various delay, 
order These 
replications were used in the analys 


no-delay 


conditions in random 


The seven rey lications require 1 sev working 
days 
Experiment 3.—-The third 


designed to determine the rel 


experiment was 


between 


at onshig 


sampling rate and tracking performance for 
targets which are slowly but constantly displaced 
from the cross hairs 
Preliminary experience with this tracking task 
indicated two things in contrast to the previous 
tracking situation ‘a little if any 
and (b) lower sampling rate 
would yield 


formance on a par with the c 


learnit 4 
would take place; 
and sensitivities tracking 
mtinuous ofr 
condition 
Five san onditions were 
a continuous 
all conditior 
20° of hand-control dis 
The "“S0% limits” 
1 he 
hot! 


used for 


was used a 
‘SO% limit 


horizontal ar 


tf performance 

as the distance in 
limensions on the within wih 
) Maintain OY, of 
rate tracking of ts 


constant rate, the error spor 


more or less about the 


than 


continuous! varying 
calibrated 


discontinuous 


zero-error line rather 
Recovery tin 
meaningful a measure for Buc h trackit v 


set of 20 


being 


¢ was thus nota 


\ new listurbances wa 


each 2 in per sec. in magnitude 


turbances were in the eight cardinal 


cardinal directions with e jual frequencie 


series was constructed so that each pair of « 


turbances was separated by an interval of 





434 


disturbance. Each disturbance changed di- 
rection 45° from the previous disturbance. Two 
sessions, on successive days, were given the Ss. 
Each session consisted of performance under 
each of the five sampling conditions (again, in 
random order) for 2 min., during which time four 
15-sec. disturbances separated by four 15-sec. 
intervals were presented. 


Resutts 


Analysis of variance.—An analysis 
of variance was carried out on the 
data from each of the three experi- 
ments. Tables 2, 3, and 4 show the 
results of these analyses for main 
effects and all significant interactions. 


CORWIN A. 


BENNETT 


Estimates of the component of vari- 
ance attributable to each factor 
(Column 6) and the percentage of the 
sum of the variance components 
attributable to each factor (Column 
7) were computed for each experi- 
ment.’ 

In Exp. 1 (Table 2) seven of the 
factors affect tracking as indicated 


7 Discussion and computation of components 
of variance may be found in Mood (5, pp. 342 ff.). 
The percentage of the sum of components of 
variance (Column 7) is simply the individual 
component divided by the sum of Column 6, 
quotient times 100. 


TABLE 2 


Summary or Anacysis or Variance: Exp. | 


Mean 
Square 


Source 
367.00 
705.00 
30.40 
7.69 
6.44 
6. 1 5 
954 
12.00 
3.28 
2.72 


> 


| 
| 
Subjects (A) 
Sampling rate (B) 
Sensitivity (C) 
Sessions (D) 
AXB 


4 | 
2 

3 
24 
12 
18 
8 
4 
g | 


4 
3 








o | 
otal | 
| 
* Significant at .05 level 
** Significant at .01 level 
t Denominator in F test. 


5,6, 7, 9 | 


% Sum of 


Components 
of Variance 


Kstimated 
Component 
| | of Variance 


19.47** 


Vet 


0.732 
1.026 


0.033 
0.018 
0.052 
0.039 
0.070 








1.970 


t For tests of effects involving several significant components, see Walker and Lev (7, pp. 273-3) 


TABLE 3 


Summary or Anatysis or Variance: Exp. 2 


Mean 


Source df Square 


Subjects (A) 
Sampling rates (B) 
Delay-no delay (C) 
Sessions (D) 
BxC 
AXBXCXD 72 
Pooled residual 2149 





Total 


2239 








* Significant at .0S level. 
* Significant at .01 level. 
t Denominator in F test. 





| 
Estimated QY 
F | Component 
of Variance 


Sum of 
Components 
of Variance 


70.70°* 

14.08* 

26.10** 
3.58* 
4.73°* 
1.35* 


0.508 
0.322 
0.247 
0.011 
0.039 
0.076 














SAMPLE-DATA TRACKING 


TABLE 4 


Summary or Anatysis or Variance: Exp. 3 


Mean 


Source Square 


| 
—_— 


Subjects (A) 
Sampling rates (B) 
Sessions (C) 
AxC 


Pooled residual | 5 





Total | 69 


* Significant at .05 level. 
** Significant at .01 level. 
t Denominator in F test. 


by the F tests. Two of these factors, 
Ss and Sampling, accounted for nearly 
90% of the explained variance. The 
remaining factors accounted for only 
1% to 3% each. These percentages 
of explained variance indicate the 
relative emphasis which should be 
given to specific procedures designed 
to optimize the tracking situation. 

The four main effects and two inter- 
actions were significant in Exp. 2 
(Table 3). The three main effects, 
Ss, Sampling Rates, and Delay-no 
delay, account for 90% of the ex- 
plained variance. Again other sta- 
tistically significant factors each ac- 
count for relatively little variance. 

Only Sampling Rate and the Ss-by- 
Sessions (days) interaction were sig- 
nificant in Exp. 3 (Table 4). Sam- 
pling rates alone, in this experiment, 
account for over 90% of the explained 
variance, 

One-tailed t tests were conducted 
for the hypothesis that the con- 
tinuous condition was not superior to 
each of the sampled conditions with 
optimum sensitivity in Exp. 1. The 
hypothesis was rejected at the 0] 
level of confidence or beyond in each 
case except the 6-cps condition, which 
was rejected at the .05 level. The 
same null hypothesis for Exp. 3 was 
rejected at the .O1 level in each case 
except the 5-cps condition. 


Estimated % Sum of 
Component Components 
of Variance of Variance 





Regression curves. — Regression 
equations were fitted to the data 
from each of the experiments. Plots 
of mean recovery time in the first 
two experiments and mean 50% 
limits in the third experiment vs. 
sampling rate indicated a 
relationship on a log-log plot. Linear 
regression equations and correlations 
were calculated predicting tracking 
performance (log recovery time or log 
50% limits) from log sampling rate.* 
These results are shown in Fig. 3, 4, 
and 5. These correlations, of course, 
indicate the goodness of fit of the 
regression equations. All except the 
correlation for Exp. 3 are significant 
at the .05 level or beyond. 


linear 


On Exp. | and 3, where a continuous 
or nonsampled condition was studied, 
the mean recovery time and mean 
50% limits for this continuous con- 
dition were used, with the appropriate 
regression equation, to predict a sam- 
pling rate which would give tracking 
performance equivalent to continuous 
sampling. ‘These continuous-equiva- 
lent sampling rates are shown with 


*Similar relationships between tracking 


performance and sampling period (7'), i.e., time 
between samples, could be plotted. These 
relationships would be of the same mathematical 
form as those with sampling rate (S) as may be 
determined by substituting 7 = 1/S in the 
regression equations. 





CORWIN A. BENNETT 


CONDS i 


OG DENSITY 1645 tr? 
) 


2 / 0%,” 


7a 


COM Os 
9 « 
. ae 5-120 
. oe bape 4 vo CCONTIMUIOUS 6°98 
0556 ‘ °< 0% 

ad 0 CONTINUOUS. S062 


ME aN PLCOWERY Tat. SE 


: 4 pages 
SAMPLING RATE, CPS (6) 
.3, Experiment 1—regression of mean 
recovery time on sampling rate. 


WO DLL AT CONDITIONS, 1 «6 
1°090, F< O08 


a 
SAMPLING Rare 


*S (5) 


Fic. 4. Experiment 2—regression of mean 
recovery time on sampling rate. 


ubeee”™, --087, eros 


~ 


—- = 
Ps 


es SOL LS, SES 


eure Fare CPS IS) 


Experiment 3—regression of mean 
’ 
50% limits on sampling rate. 


the corresponding regression curves 
in Fig. 3, 4, and 5. 


Discussion 


Sampling rate-—The analysis of vari- 
ance results of the three experiments 
consistently show the importance of 
sampling rate as an influence of tracking 
performance—the percentage of ex- 
plained variance attributable to this 
source being 52, 33, and 92, respectively. 

The regression equations serve to 
predict what particular rate would be 


equivalent to analog conditions for a 
given tracking situation. Thus, for a 
tracking situation involving targets 
which are discretely displaced from the 
cross hairs, such as studied in Exp. 1, 
the prediction is that a sampling rate of 
about 9 cps would be equivalent to 
continuous sampling tracking. In con- 
trast to the rather high continuous- 
equivalent sampling rate predicted for 
the discretely displaced target situation, 
this predicted rate for the tracking of 
constantly displaced targets is 5 cps. 

It is interesting to speculate concerning 
this difference in the predicted con- 
tinuous-equivalent sampling rates for 
the two types of tracking situations 
discretely displaced and constantly dis- 
placed targets. 

In the constantly displaced tracking 
situation, S “lets” the target move 
slowly off the cross hairs some small 
distance and then attempts to return it. 
Once the error is nulled, a correct dis- 
placement of the hand control will 
produce a signal equal in magnitude and 
opposite in direction to the disturbance 
rate. Such a hand-control displacement 
would keep the error nulled as long as the 
particular disturbance is active. To 
achieve such a situation, however, is a 
rather difficult task, and generally S 
must again eventually return the target 
which has “crept’’ out some distance 
from the cross hairs. The tracking of 
such constantly displaced targets seems 
to consist, in part, of a series of tracking 
activities similar in nature to those per- 
formed with discretely displaced targets. 
This leaves the magnitude of the dis- 
turbances (the distance through which 
the target must be returned) as a possible 
explanation for the difference in con- 
tinuous-equivalent sampling rate. 

An analysis of variance of the initial 
slope of the response curves for the last 
session of Exp. 1 was carried out. Since 
these curves indicate target position as a 
function of time, the slope indicates rate 
of movement of the target (as produced 
by 8), i.e., the utilized sensitivity for a 
given response. This slope is not con- 
stant, but is nearly so, initially. This 
analysis of variance indicates that the 





SAMPLE-DATA TRACKING 


the disturbance has a 
influence on utilized sensi- 
particular, for the 2-in. 

the utilized sensitivities 
are nearly twice as large, on the average, 
as for the 1-in. disturbances. The 
analysis of recovery times for Exp. 1 
indicates that, for higher sampling rates, 
higher sensitivities are optimum. Thus, 
it may be reasoned that, for larger 
disturbances, higher sensitivities and 
higher sampling rates are required. The 
discrete displacements were larger than 
the resulting from the 
integration of rate of displacement ob- 
tained with the constantly displaced 
targets. Thus, higher sampling rates 


magnitude of 
significant 
tivity. In 
disturbances, 


displacements 


for continuous equivalence would seem 


to be required for the discretely displaced 
targets. 

Delay.—In Exp. 2, the delay-no delay 
main effect accounts for nearly 20° of 
the explained variance. The delay by 
sampling rate interaction accounts for 
about 3%. The effects are in the pre- 
dicted direction—delay producing con 
siderable degradation of tracking per 
formance, and, in particular, more 
degradation at the lower sampling rates 
where the delay is longer. This fact 
points to one possible explanation of the 
effects of sampling rate, " 
tracking performance. As _ previously 
pointed out, sampling itself introduces 
delays varying from one to two sampling 
periods, with a mean of one and one-half 
sampling periods. Thus, the delay at a 
lower rate is longer than at a higher rate, 
and performance is presumably poorer. 

Sensitivity.—The main 
effect in Exp. 1 is not significant. This 
may be due to the fact that all three 
sensitivities (“high,” “low,” 
were chosen to be in the 


per se on 


sensitivity 


“optimum”’) 
“usable” range, 
and thus the principal effects of sensi 
tivity were eliminated before the experi- 
ment began. The 
sampling rate interaction was significant. 
Examination of means for the various 
sampling rates and sensitivities indicates 
that his interaction is principally due to 
the superiority of the “low” sensitivity 
at 2cps. Tracking under this condition 
was better than under “optimum” sensi- 


sensitivity by 


tivity, owing to the relatively crude 
method of choosing these 
prior to the experiment. 
Subjects.—The Ss’ main effects were 
significant and contributed about 40°; 
of the sum of explained variance com 
the first experiments, 
but were not even significant in the third 
experiment. This is probably due to 
some difference in the task, which also 
permitted learning in the first two 
experiments (particularly in the first 
three trials which were not included in 


sensitivities 


ponents in two 


the analysis of variance) and no learning 
in the third Subjects by 
Sessions interaction was significant in 
Exp. 1 and 3, as was the Sessions main 
effect in Exp. 2. These effects reflect 
decreased rather than increased per 
formance, and thus may be attributed 
to decreased motivation, rather than to 
effects of learning, per se. 

Subjects by Sampling Rate was sig 
nificant in Exp. 1. Study of 
reveals that this was due to very 
performance by 
sampling rates. 


experiment 


means 

poor 
very low 
Subjects by Sensitivi 
ties was also significant in Exp. 1 This 
effect to the fact that one § 
performed better at the low sensitivities 
than at sensitivities. It is 
dificult to ascribe these effects to any 
factor controlled in experiments 
In any case, it may be pointed out that 


some Ss at 


was due 
optimum 
these 


in the present instance, these interactions 
contributed very small percentages of the 
total explained variance. 

Two third-order 
significant. These interactions were: 
(a) the Ss by Sampling Rates by 
Sensitivities by Sessions in Exp. 1; and 
(4) the Ss by Sampling Rates by Delays 
by Sessions in Exp. 2 


interactions were 


Interpretation 
of these effec ts appears impossible 


SUMMARY 


In the situation where an operator closes a 
control loop by his tracking behavior, the usual 
analog loop may be altered by sampling of the 
input to the operator or by sampling the oper 
ator’s output. Three experiments, in which S's 
output was sampled, established the importance 
of sampling rate as a determinant of tracking 


performance A functional relationshiy (track 





438 


ing performance is proportional to the sampling 
rate raised to some power) was established, such 
that tracking performance is poorer at lower 
sampling rates. 

The variables of hand-control sensitivity and 
operator response-effect delays were found to 
affect sampled-data tracking performance. 


REFERENCES 


1. Barrio, W. F., Greco, L. W., Nacet, E. H., 
Smauit, A. M., Ja, & Brocpoen, W. J. 
Tracking and frequency of target inter- 
mittence. J. exp. Psychol., 1954, 47, 
309-314, 

2. Barric, W. F., Voss, J. F., & Brocpen, W. J. 
Effect of frequency of target intermittence 
upon tracking. J. exp. Psychol., 1955, 
49, 244-248. 

3. Cuapanis, A., Ganner, W. R., & Morcan, 
C. T. Applied experimental psychology. 
New York: Wiley, 1949. 


CORWIN A. 


BENNETT 


4. Katz, M. S., & Spracc, S. D. S. Tracking 
performance as a function of frequency 
of course illumination. J. Psychol., 1955, 
40, 181-192. 

5. Moon, A. F. 
statistics. 
1950. 

. Rocxway, M. R. The effect of variations 
in control-display ratio and exponential 
time delay on tracking performance. 
USAF, WADC Tech. Rep., 1954, No. 
54-618. 

. Wacxer, H. M., & Lev, J. Statistical 
inference. New York: Holt, 1953. 

. Warrick, M. J. Effect of transmission-type 
control logs on tracking accuracy. 
USAF, WADC Tech. Rep., 1949, No. 
5916. 


Introduction to a theory of 
New York: McGraw-Hill, 


(Received for early publication 
November 17, 1955) 












